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Abstract 

On the Isomorphism Problem of p-Endomorphisms 
Peter Jong, Ph.D. 
Department of Mathematics, University of Toronto, 2003 

Let X = {X, B, ji, T) be a measure-preserving system on a Lebesgue probabihty space. 
Given a fixed probabihty vector p = we say that X = {X,B,ii,T) is a p- 

endomorphism if T is s-to-1 a.e. and the conditional probabihties of the preimages are 
precisely the components of p. Two measure-preserving systems X = {X, B, ji, T) and Y = 
(Y, C, V, S) are isomorphic if there exist a measure-preserving bijective map X ^ Y 
such that i-pT = Sip a.e. 

This thesis considers the isomorphism problem of p-cndomorphisms, generalizing the 
work of Hoffman and Rudolph [H,R] which treats the case when p is a uniform probability 
vector, i.e. p = . . . , In particular, we generalize the tvwB criterion introduced in 
Hoffman and Rudolph to prove two results. 

The first result is Theorem 2.4.1, which generalizes the main theorem in [H,R] to p- 
endomorphisms. We paraphrase this as follows: 

Theorem 2.4.1. Let X = iX,B,n,T) be a p-endomorphism. Then X = {X,B,ii,T) is 
one-sided Bernoulli if and only if X = {X, B, /i, T) is tvwB. 

We give two proofs of this result. The first follows Ornstein's classical proof of his 
famous theorem that two shifts of equal entropy are isomorphic, and a second proof which 
follows the joinings proof as given in [H,R]. As a corollary of the joinings proof, we show that 
there are uncountably many automorphisms of the one-sided Bernoulli shift S+(p) unless 
the components of p are pairwise distinct. We also give examples of tvwB p-endomorphisms 
such as mixing one-sided Markov shifts and a generalization of the [T, Id] transformation. 

The second main result is Theorem 5.1.1, which in view of Theorem 2.4.1, reduces to the 
statement that for any two tvwB finite group extensions of one-sided Bernoulli shifts, there 
is an isomorphism between them in a stronger sense than that asserted in Theorem 2.4.1. 
Specifically, we have the following theorem in Chapter 5 which we paraphrase as follows: 

Theorem 5.1.1'. Let G he a finite group. For any two tvwB G-extensions of the one-sided 

shift B^{p), there is an isomorphism which preserves the Bernoulli factor algebra and maps 
Gbres over points in the factor to other such fibres by group rotations. 
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Chapter 1 



Introduction 

1.1. Background 

Let X be a compact metric space and B be its Borel cr-algebra. Suppose {X, B, /x) is a 
nonatomic Lebesgue probability space (a probability measure space isomorphic to the 
unit interval with Lebesgue measure). An endomorphism of X is a measure-preserving map 
T : X ^ X, i.e. for all B E B, fi{T^^B) = 1^(3). For us, a measure-preserving system 
(m.p.s.) is a quadruple {X,B, ii,T) where {X,B,fJ,) is a nonatomic Lebesgue probability 
space defined on a compact metric space X, and T is an endomorphism. 

This thesis is mainly concerned with the isomorphism problem of measure-preserving sys- 
tems. Two measure-preserving systems X = {X, B, ji, T) and Y = (Y, C, v, S) are (measure- 
theoretically) isomorphic if after deleting null sets Xq and Yq from X and Y respectively, 
there is a bijection : X\Xo Y\Y^ such that (j) is measure-preserving and (/)T = S(}) on 
X\Xo. We say that (j) is an isomorphism from X to Y. In case X = Y, we say that (j) is an 
automorphism of X. More generally, a factor map : X ^ Y is a measure-preserving 
map (j) : X ^ Y such that (j)T = Scf) a.e. In this case, we say that Y is a factor of X, and 
X is an extension of Y. 

The classical isomorphism problem in ergodic theory is the classification of two-sided 
Bernoulli shifts. To define a two-sided BernouUi shift, fix a probability vector p = 
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(pi, . . . ,Ps). Consider the finite set / = {1, . . . , s} and a measure m on 7 defined by m(j) = pj. 
Construct the product space I^. Let B denote the Borel sigma- algebra and a denote the 
left-shift transformation defined by a{x)j = Xj+i, where Xt is the t-th coordinate of x. The 
two-sided Bernoulli shift -B(p) is the m.p.s. (/^, B, m^, a). 

The breakthrough in the classification problem of two-sided Bernoulli shifts came when 
Kolmogorov introduced the concept of entropy into ergodic theory. It is not difficult to 
show that the entropy of a m.p.s. is invariant under isomorphism. Moreover, the entropy of 
a Bernoulli shift is easy to compute. Indeed, for a probability vector p = (pi, . . . ,Ps), define 
the entropy of p, denoted /i(p), to be Ylj=i ~Pj^^S2Pj- 1^ can be shown that the entropy of 
the two-sided Bernoulli shift -B(p) is just h{p). It thus follows that the two-sided Bernoulli 
shift on two symbols with equal weights, which has entropy log22, is not isomorphic to the 
two-sided Bernoulli shift on three symbols with equal weights, which has entropy log23. 

The solution of the classification problem was finally achieved around 1970 when Ornstein 
[Orn] showed that entropy is in fact a complete invariant for 2-sided BernouUi shifts; that 
is, two 2-sided Bernoulli shifts are isomorphic if and only if they have the same entropy. 
More generally, it can be shown that entropy is also a complete invariant for two-sided shift 
spaces with finitely determined measures. Since Ornstein's original proof, other criteria 
such as the weeik Bernoulli and the very weak Bernoulli conditions have been developed 
which also turn out to be sufficient for a m.p.s. to be isomorphic to a two-sided Bernoulli 
shift. The reader is referred to Shields [Shi] for an excellent account of the proof of Ornstein's 
theorem as well as a discussion of finitely determined measures. Examples of weak Bernoulli 
and very weak Bernoulli systems such as ergodic toral automorphisms and two-sided mixing 
Markov shifts are discussed in Petersen [Pet] . 

What we will deal with in this thesis is the isomorphism problem in the case when the 
endomorphism T : X ^ X is not invertible. Throughout this thesis, N* will denote the 
set of nonnegative integers. One example of a non- invertible endomorphism is a one-sided 
Bernoulli shift, which is derived from a two-sided Bernoulli shift by restricting the shift 
space to {1, ... , s}^' . Let -B+(p) denote the m.p.s. obtained from -B(p) in this way. In this 
connection, we remark that Ornstein's theorem does not hold for one-sided Bernoulli shifts. 
Indeed, it is easy to construct two probability vectors with different numbers of components 
which have the same entropy. This leads to two 1-sided shifts having the same entropy; 
however, they are not isomorphic because they have distinct numbers of inverse images. 



2 



In a recent paper, Hoffman and Rudolpfi made a fundamental contribution to the isomor- 
phism problem of measure- preserving systems with non-invertible maps. In [H,R], Hoffman 
and Rudolph considered a class of endomorphisms called the uniformly p-to-1 endomor- 
phisms, and introduced a condition called tree very weaik Bernoulli (tvwB) on this 
class. They then proved that the tvwB condition is necessary and sufficient for a uniformly 
p-to-1 endomorphism to be isomorphic to the one-sided Bernoulli shift i?+(p), where p is the 
uniform vector (1/p, . . . , As the authors remarked, entropy turns out to have no role in 
this theory. Perhaps as a result of this, many of the arguments are much simpler than those 
in Ornstein's proof. This thesis will consider generalizations of the tvwB criterion which will 
allow us to prove isomorphism theorems for more general measure-preserving systems with 
non-invertible transformations. 

1.2. Organization and Contributions of Thesis 

Without delving into definitions and details, which will be presented after this section 
and in subsequent chapters, we outline below the contents and main results in each chapter 
of this thesis. 

Chapter 1 is this introduction. In §1.3, we will define the class of objects of interest in 
this thesis, the p-endomorphisms, and extend the tvwB criterion to a general probability 
vector p (i.e. p not necessarily uniform). 

Chapters 2 and 3 give two proofs of one of the main results in this thesis. Theorem 2.4.1, 
which generalizes the main result in [H,R] to p-endomorphisms for a general probability 
vector p: 

Theorem 2.4.1. Let p = {pi,...,Ps) be a probability vector. Jf X = {X, B, iJ,,T) is a 
p-endomorphism, then X is tvwB if and only if X = B^{p). 

In Chapter 2, we give a proof which follows the classical argument used by Ornstein in his 
isomorphism theorem. The proof is based on a joint paper by del Junco and me [J, J] which 
treats the case when p is a uniform probability vector. In particular, it does not require the 
machinery of joinings introduced in [H,R]. The main additional ingredient which enables us 
to extend the proof in [J, J] is proposition 2.1.3, which essentially says that under certain 
factor maps (tree-adapted factor maps) between p-endomorphisms, conditional probabilities 
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of inverse images are preserved. In §2.5, we give an entirely different and quite elementary 
proof of Theorem 2.4.1 in the special case when the components of p are pairwise distinct. 

In Chapter 3, we will mirror the proof of the main result in [H,R] and extend their notion 
of one-sided joinings to give a proof of Theorem 2.4.1. The new ingredient needed is an 
additional condition in the definition of one-sided joinings which is trivial in the case of a 
uniform probability vector. We will also show that the joinings proof implies that there 
are uncountably many automorphisms of -B'*"(p), unless the components of p are pairwise 
distinct (in which case the identity is the only one). 

Chapter 4 illustrates some examples of tvwB p-endomorphisms such as certain classes of 
one-sided Markov shifts and certain extensions of one-sided Bernoulli shifts. It follows that 
these are all one-sided Bernoulli by Theorem 2.4.1. 

Chapter 5 proves an isomorphism theorem for tvwB finite group extensions of one-sided 
Bernoulli shifts. The main result is Theorem 5.1.1, which in view of Theorem 2.4.1 reduces 
to the following result which we paraphrase as follows: 

Theorem 5.1.1'. For any two tvwB finite group extensions of tlie one-sided sliift -B+(p), 
tliere exists an isomorphism which preserves the BernouUi factor algebra and maps fibres 
over points in the factor to other such hbres by group rotations. 

1.3. The Tree Very Weak Bernoulli Condition 

Suppose that {X, B, /i) is a nonatomic Lebesgue space and T X ^ X is measure- 
preserving. Fix a probability vector p with finitely many components. Let |p| denote the 
number of components of p. A m.p.s. X = {X, B, ji, T) is a p-endomorphism if T is |p| to 
1 a.e., the conditional probabilities of the |p| inverse images of x are the components of p 
for a. a. x, and the entropy of X is /i(p). Throughout this thesis, we will let End{p) denote 
the collection of p-endomorphisms. 

The standard element in End{p) is the one-sided Bernoulli shift i?+(p). Let us recall its 
definition here. For the finite set / = {1, . . . , |p|}, define a measure m on / by m(j) = Pj. 
Consider the product space I^* with the Borel sigma-algebra B, product measure m"^* and 
shift transformation a. We then define -B+(p) to be the m.p.s. {I^' , B, m^' , a). (The entropy 
condition in the definition of p-endomorphism is thus a natural one to make since S+(p) has 
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entropy /i(p).) 

In [H,R], Hoffman and Rudolpfi introduced a condition called tree very weak Bernoulli 
(tvwB) which they proved to be necessary and sufficient for a uniformly p-to-one en- 
domorphism, i.e. a p-endomorphism with p = (1/p, . . . , l/p), to be isomorphic to the 
one-sided Bernoulli shift on p symbols with equal weights. We will now extend the various 
definitions in [H,R] in order to handle the case that the components of the probability vector 
are not all equal. 

For a probability vector p, we define the p-tree to be the set, denoted T, consisting of 
all finite sequences (including the empty sequence) of integers in {1, . . . , |p|}. We define a 
node to be an element of the p-tree. The length of a node v, denoted \v\, is the number 
of integers in the sequence v (thus, the empty sequence has length zero). We will refer to 
the empty sequence as the root node and denote it by 0. Given any two nodes u and 
we define the node uv by concatenating the sequence u to the left of the sequence v. We 
will use T' to denote the set T\0. Define the map u : T' ^ T by setting a{v) to be the 
sequence obtained by deleting the leftmost symbol in the sequence v. Note that if = 1, 
then a{v) = $. Moreover, o" is a |p| to 1 surjection. It is helpful to picture a p-trcc as a 
tree in the graph-theoretic sense with vertices corresponding to the nodes and with an edge 
between u and (7{u) for each u ^ ^. The picture we get is the usual |p|-ary tree. In view 
of this pictorial representation, we refer to the set {uv \ < \u\ < N} as the subtree of 
height N rooted at v and the set {uv \ \u\ > 0} as the subtree rooted at v. 

To each node v in T, we assign a weight, denoted Wy, as follows. For the root node 0, 
we set W0 = 1. For any other node v = (ai, . . . , aj), set Wy = Yil^iPai- We define a tree 
automorphism to be a bijection A -. T ^ T such that A o a{v) = a o A{v) and Wy = Wav 
for V eT' (i.e. A preserves the tree structure and weights). Note that this implies ^4(0) = 0. 
It is obvious that the set A of tree automorphisms forms a group under composition. For 
G N, let 7^ C T denote the set of nodes of length < N and let 7^ denote the set Tj^^J). 
Let An be the subgroup of bijections of Tv which is ^|rjv- 

Given a fixed compact metric space {R, d), let us say that a i?-tree name is a function 
g : T' ^ R, and for each e N, a R,N-tree name is a function g' : ^ R. As in [H,R], 
we define a distance function, t^, on the space of R,N-ticee names as follows: if /i : 7^ — > i? 
and h' -.Tj^f^ R, let 
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t^ih, h') = ^ inf V d{h{v), h\Av))w,. 

Q<\v\<N 

Note that t^r is not a metric but it does satisfy the triangle inequahty. We say that two 
R,N-ticee names h and h' are the same up to tree automorphism if there exists some 
tree automorphism A e An such that h{v) = h'{Av) for all v. Note that ijv(^, h') = if and 
only if h and h' are the same up to tree automorphism. We shall let it!^"" denote the set 
of R,N-ticee names and R^^ denote the equivalence classes of R^N-tree names modulo tree 
automorphism. 

Suppose that X {X, B, ^,T) G End{p). We wish to construct i?-tree names for each 
point in X. To do this, fix a measurable partition : X {1, . . . , |p|} such that for a.a. 
X, the |p| inverse images of x have distinct Kj^ values and the conditional probability of the 
inverse image x' of x with Kx{x') = j is pj. We will refer to as a tree peirtition of X. 
Note that unless the components of p are pairwise distinct, there will in general be many 
distinct tree partitions. 

We now use to define a collection of partial inverses Ty for each node v e T'. For 
each node v of length one (i.e., v G {1, • • • , |p|}), define the map T„ : X — > X by setting T^x 
to be the inverse image x' of x with K^ix') = v. We may then extend the definition of 
to an arbitrary node f G T' as follows: if f = (ai, . . . , a„), set Ty{x) = Ta^{Ta^ . . . (Ta^x) . . .). 
Note that is injective and v i— > TyX maps T' to {T~^x | j > 0} for a.a. x. If y E T~^x 
for some j G N, then there exists a unique v of length j such that y = TyX, and w„ is the 
conditional probability of the preimage y of x under the map . 

Suppose we have some function g : X ^ R. Using the tree partition Kj^, we may now 
associate to a.a. x in X the i?-tree name : T' — > by setting t^{v) = g{TyX). We 
shall refer to the it!-tree name as the g-tree name of x and the restriction r^l-r' as the 
g-N-tree name of x. For any function f : X ^ R and n G N, define /""^ : X — > i?""^ by 
sending x to its /-n-tree name and /"^ : X — > R"-^ by sending x to the equivalence class in 
i?"^ containing its /-n-tree name. 

Note that for any g : X ^ R, the 7?-tree name rj' depends on the choice of the tree 
partition Xx- However, the tvwB condition, which we now define for p-endomorphisms, is 
not affected by the choice (since for any g : X ^ R, different choices yield the same g-N-tiee 
name of x up to tree automorphism) . Note that this definition is essentially the same as the 
definition in [H,R]. The only difference is that our definition of i„ is shghtly different as the 
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group of tree automorphisms is more restrictive in the case of a general probabihty vector p. 

Definition 1.3.1. Let X = {X,B, fi.T) E End{p) and g -. X ^ R for a compact metric 
space {R,d). We say that C^,g) is tree very weak Bernoulli (tYwB) if for each e > 0, 
there exists some N such that, whenever n > N, we have some set G Q X of measure at 
least 1—e with in{T^, t^) < £ for all x and y in G. 

We shall say that the a m.p.s. X e End{p) is tvwB if for all measurable functions 
g : X ^ R ior a compact metric space R, (X, g) is tvwB. It is immediate that the tvwB 
property is preserved under isomorphism. 
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Chapter 2: 

TvwB p-Endomorphisms on Lebesgue 
Spaces 

The goal in this chapter is to prove the following theorem stated in §2.4, which generalizes 
the main result in [H,R] to an arbitrary probability vector. 

Theorem 2.4.1. Let p = (pi, . . . be a probability vector. If X = {X, B, ji, T) e End{p), 
then X is tvwB if and only if X ^ -B+(p). 

This chapter is organized into several sections. The first three sections establish some 
basic tools that will be necessary in the proof of Theorem 2.4.1. In §2.1, we define the concept 
of a tree-adapted factor map and establish a crucial property shared by these factor maps. 
§2.2 proves the tree ergodic theorem, which may be considered as a "backward" version of 
Birkhoff's ergodic theorem. §2.3 establishes the tree Rokhlin lemma and its strong form, 
which are analogues of the standard Rokhlin lemma for invertible transformations. §2.4 
contains the proof of the main theorem. The proof in §2.4 does not require the machinery of 
one-sided joinings introduced in [H,R] and is patterned on the proof of Theorem 2.4.1 given 
in a joint paper by del Junco and me [J, J] in the case that p is a uniform probability vector. 
§2.5 gives a completely different, and quite simple, proof of Theorem 2.4.1 in the special case 
that p has pairwise distinct components. 
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Throughout this chapter, p will denote the probability vector (pi, . . . ,Ps). 

2.1. Tree- Adapted Factors 

Definition 2.1.1. Let X = {X, B, T) e End{p), then a m.p.s. Y = [Y, C, u, S) is a tree- 
adapted factor of X if there is a factor map : X — > Y such that for a.a.x in X, (j) gives 
a bijection of the inverse images of x and those of (t){x). We will refer to the factor map 
: X ^ Y as a tree-adapted factor map. 

Note that this definition is more restrictive than in [H,R], which requires only that (j) maps 
T~^x one-to-one into S~^4>{x). Nonetheless, this definition will sufiice since the factor maps 
which we construct in the proof of the isomorphism theorem (in particular, in Proposition 
2.4.9) is tree-adapted in our sense. 

The following proposition shows that a tree- adapted factor of a p-endomorphism is itself 
a p-endomorphism. It is the extension of Lemma 2.3 in [H,R] to p-endomorphisms. For a 
p-endomorphism X = (X, B, fi, T) , define the function px : X ^(0,1) by setting py^{x) to be 
the conditional probability of the preimage x of Tx. We shall refer to px as the p-function 
on X. 

Proposition 2.1.2. A tree-adapted factor of a p-endomorphism is a p-endomorphism. 

Proof: Let X = {X,B, iJ,,T) e End{p). Suppose Y = (F, C, z/, 5) is a tree-adapted factor 
of X and cf) -. X ^ Y is a, tree-adapted factor map. By definition, S is also |p|-to-one a.e. 
Fix a point y E Y, and suppose yi, . . . , are the inverse images of y. For each x e (f)~^y, 
the conditional measure on T~^x given x pushes forward via to a measure on S~^y. The 
conditional measure on S~^y given y is just an average of these image measures on S~^y over 
all X e (p'^y- As is tree-adapted and X e End{p), these image measures assign the s inverse 
images of y with measures equal to the components of p. Thus, (Py(1/i)), • • • iV^iUs)) is an 
average of probability vectors (Pa^(i), . . . ,Pcr{s)) for a permutation a : {1, . . . , s} {1, . . . , s} . 
The fact that the function h{t) = — tlog2t is strictly concave implies that the entropy of the 
probability vector (PyIj/i)), ■ ■ • ,Pyiys)) is at least h{p) with equality holding if and only if this 
probability vector is some permutation of p. The fact that the entropy of X is h{p) implies 
that equahty must hold for a.a. y and so the entropy of Y is h{p). But this is precisely 
saying that Y = (Y, C, i/, S) is a p-endomorphism. ■ 
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The following proposition gives an important property shared by tree-adapted factor 
maps; namely, they preserve the p-function. 

Proposition 2.1.3. Suppose X = {X, B, fi,T) G End{p). If : X — > Y is a tree-adapted 
factor map, then for a.a.x in X, pji_{x) = p^{4>{x)). 

Proof: By proposition 2.1.2, Y = {Y,C,u,S) e End{p). Let y E Y and let /ly be the 
conditional measure on (f)~^{y) given y. With no loss of generality, suppose pi > ... > Ps- 
Let yi, . . . ,ys be the inverse images of y such that p^{yj) = Pj- Since is tree-adapted, 
for each x e (f)~^{y), let Xy. be the unique inverse image of x such that 4>{Xy.) = y^. Now, 
Pi = PYiVi) is an average of px(a^yi) over x e (f)~^{y). As px(a^yi) < Pi, it follows that 
Py(1/i) = l^i = Px(a^yi) for fiy-a.a. x. Next, note that since Xy^ ^ Xy^ by trcc-adaptedness, 
Py^iXy^) < P2 for fiy-a.a. x. Hence, Py(2/2) = P2 = Py.{Xy^) for /^^-a.a. x. Inductively, we sec 
that p^iyj) = py^{Xy.) for fiy-a.a.x for each 1 < j < s. As 7/ is arbitrary, there is a set G of 
full measure in X such that for each x e G, if x' e T'~^a;, then p^{x') = p^{(t){x')). Then 
T~^G is also a set of full measure satisfying the statement of the proposition. ■ 

If X = {X,B,ix,T) e End{p) and x e X, define the px-name of x to be the sequence 
(p:x.{x),p^^{Tx), . . .). Proposition 2.1.3 implies that if : X ^ Y is a tree-adapted factor 
map, then the px-name of x equals the pv-name of (f){x) for a. a. x. 

The fact that the p-function is preserved under tree-adapted factor maps leads to the 
following proposition which will be used in the proof of Theorem 2.4.1. 

Proposition 2.1.4. Suppose X = {X,B,iJ,,T) e End{p) and Y = {Y,C,t^,S) is a tree- 
adapted factor of X. Consider a tree-adapted factor map : X — > Y. For any measurable 
function f :Y ^ R and any m e N, we iiave for a.a.x, 

(/o0r^(x) = r^(0(x)). 

Proof: For x E X, define the map tt^. : 7^ ^ TJ^ by setting 7i,^{v) = -u if <^(T„,t) = Sud>{x) and 
1^1 = \u\. Note that this map is wcll-dcfincd and is a bijcction as (f) is trcc-adaptcd. Moreover, 
since is a factor map, it follows that ir^ o a = a o n^. By the remarks following proposition 
2.1.3, px-name of 2; = pv-name of ^(z) for a.a. z and thus tt^ preserves the weights of nodes 
so that TT^ defines a tree automorphism. It follows that for every v e T^, 

U 
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2.2. The Tree Ergodic Theorem 



The goal in this section is to demonstrate a tree version of the ergodic theorem (propo- 
sition 2.2.3). To begin, notice that the following is a trivial consequence of the definition of 
tvwB of a m.p.s. X e End{p). 

Proposition 2.2.1. Suppose X e End{p) is tvwB, then X is ergodic. 

Proof: Consider the characteristic function g = Xg for an invariant set G. Then (X, Xg) 
can only be tvwB if G has measure zero or one. ■ 

For a m.p.s. X = {X, B, /i, T) and a real- valued function g on X and a; e X, set 



The following proposition is proved using Birkhoff 's ergodic theorem. 

Proposition 2.2.2. Suppose X = {X, B, fi,T) G End{p) is ergodic. Let B C X. Given 
s > 0, there exists N & N such that for all n > N, we have a set G of measure > 1 — s such 
that for each x E G, 



Since X is ergodic, we have for a.a.x, An{xB){x) — > l~i{B) as n — >• cx). In particular, for 6 > 0, 
there exists some N such that for all n > N and for all y in a set G of measure > 1 — S, 



A^{g){x) = -Y,9{T'x). 



i=0 




Proof: Note that 




(1) 



A^{XB)iy) - f^iB)\ < 5. 



(2) 



Hence, if /j.^ is the conditional measure on T given x, then 




(3) 



+ I w^A„{xB){TyX) - w^ii{B)\ 




<5 + 2V5. 
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Since S can be chosen to be arbitrarily small, the result now follows from (1). ■ 



Let X = {X, B, n, T) e End{p) and consider a partition P : X — > C for some finite set 
C. Given M e N and x E X, define a measure 9^^m,p on C by 

Ox,M,p{c) = Yl 

{veTl^\P{T,x)=c} 

for each c E C. We say that x E X is £,M-generic for P if 

^|^.,M,p(c)-M^"'(c))| <£■ 

The following is the Tree Ergodic Theorem and follows easily from proposition 2.2.2. 

Proposition 2.2.3 (Tree Ergodic Theorem). Let X = {X,B,ii,T) he an ergodic p- 
endomorphism and P : X ^ C be a finite partition. For every e > 0, we have for all large 
M, a set G of measure 1 — e such that for each x E G, x is e,M-generic for P. 

2.3. The Tree Rokhlin Lemma 

Let X = {X,B, fi.T) be an ergodic m.p.s. A £-tree Rokhlin tower of height + 1 
in X is a collection of pairwise disjoint measurable sets {Bq, Bi, . . . , B^} in X such that 
Bj = T~^Bo and whose union Uf^^Bi has measure > 1 — e. If M is such a tree Rokhlin 
tower, we will let UM denote the union of the sets in M. We will refer to the set Bq 
as the base of the tower and the set Bj = T~^Bq as the j-th level of the tower. If 
B' C B, we will refer to the union U^iT~'5' as the column of the tower over B'. We 
now establish the analogues for p-endomorphisms of the standard Rokhlin lemma and the 
Strong Rokhlin lemma. Their proofs follow along the same lines as propositions 5.2 and 5.3 
in [H,R], although their results were only stated for uniformly p-to-1 endomorphisms. For 
completeness, we include the proofs below. 

Proposition 2.3.1 (Tree Rokhlin Lemma). Let X = (X, fi, /x,T) be ergodic, then for 
each e > and N eN, there exists a e-tree Rokhlin tower of height N+1 in X. 

Proof: For a set D C X of positive measure, define the set 

B = {xeX \ min(i >0\T'xeD) = mod{N + 1)}\D 
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Note that B is well-defined since by ergodicity, for a.a. x in X, T^x e D for some i e N. 
It suffices to show that B fl T~^B = 0, for each 1 < j < N. Suppose on the contrary, 
X e -B n T-^B for some 1 < j < A^. Then there exists y e B such that y = T^x. By the 
definition of B, the set {x, Tx, . . . , T^~^x, T^x = y} is disjoint from D, hence we have 

min(t >Q\T'xeD)= mm{t > \ T'y e D) + j. 

Since x E B and y E B, taking mod(A'"+l) of both sides forces j = 0, which is a contradiction. 
Hence B n T-^B = 0. Let M = {T-'B \ 0<i<N}. 

Now, as X is ergodic, //(-B) >1/{N + 1)— jJi{D) and so 

ti{Ut,T-'B) = {N + 1MB) >1-{N + IMD). 

Choosing D such that //(-D) < e/{N + 1) shows that the Rokhhn tower M has the desired 
property. ■ 

Before we prove the Strong Tree Rokhlin lemma, we introduce some notations which 
we will use in the rest of this thesis concerning distributions of partitions on a probability 
measure space. For a finite set C, define the total variation norm on the probability 
measures on C by 

\p-9\=Y,\Pic)-9{c)\. 

We write p'^ 6 if \p — 6\ < e. Suppose (fi, B, A) is a probability measure space. For a subset 
G C Q of positive measure and a measurable finite partition P : Q ^ C , let P\G denote the 
restriction of P to G. Let be a probability measure on B defined by \g{B) = X{Br\G)/ X{G) 
for B C fl. Obviously, X = Xq if G = il. For a partition Q -. G ^ G, let distxciQ) be the 
probability measure on G defined by distxa{Q){c) = Xg{Q~^{c)) for each c E G. We will refer 
to the sets Q~^{c) C G, c e C, as the atoms of Q. Where there is no ambiguity as to the 
measure on the domain of the partition, we will generally omit the subscript and just write 
dist{Q). For a measure preserving transformation T on (Q, B, X) and a partition P : Q — > C, 
we define the P-name of x to he the infinite sequence (P(x), P{Tx), . . .) and the P-n-name 
of X to be the finite sequence (P(a;), . . . , P(T"~^x)). 

If P : Q ^ G and Q -. fl ^ G' are partitions into finite sets G and G', we say that Q < P 
if there exists a map tt : C — > C" such that Q = tt o P, i.e. knowing P{x) determines Q{x). 

We will need the following technical lemma that guarantees the measurability of the 
atoms of the various partitions that will be constructed in the proof of the Strong Tree 
Rokhlin lemma and in the proof of Theorem 2.4.1. 
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Lemma 2.3.2. Suppose X = {X, B, ji, T) e End{p). Consider a measurable finite partition 
P : X ^ C. For any C ,n-tree name t, the set {x & X \ = t on 7^'} is measurable. 

Proof: Note that r induces a subset in C" whose elements are indexed by the nodes of 
length n. In fact, if \v\ = n, we may associate the sequence f{v) = {t{v), . . . ,T{a"'~^v)) e C" 
to V. Let C{v) denote the set {x E X \ P-n-name of a; is f{v)}. 

Let G = U|„|=„T^X n C{v), which is clearly measurable by the definition of the partial 
inverses in §1.3. For x e X, let /i^ be the conditional measure on T'~"x given x. Notice 
that the function g -. X ^ [0,1] defined by g{x) = ii^iG) is measurable. In particular, the set 
{x E X \ g{x) = 1} is measurable. However, this is precisely the set {x E X \ = t on 7^'}. 
■ 

Proposition 2.3.3 (Strong Tree Rokhlin lemma). Let X G End{p) be ergodic. Con- 
sider a measurable partition P -. X ^ C. Then for each e > and iV G N, there ex- 
ists a e-tree Rokhlin tower of height N-hl in X whose base B is independent of P, i.e. 
dist{P\B) = dist{P). 

Proof: Let N' be an integer and e' > 0, both to be specified later. Use the Tree Rokhlin 
lemma to build a £'/2-tree Rokhlin tower M' of height N' + 1 with base D. Let P be the 
partition P'^''^\D. By Lemma 2.3.2, P defines a measurable partition of D. For each atom 
a in P, divide a into N + 1 measurable disjoint sets P^, . . . , P"^ of equal measure. Consider 
the set 

^' = U U U ^~'(^"- 

P 0<i<N j = i "lod (iv + i) 

0<j<N' ~N 

Notice that B' is disjoint from P-^B' for all < j < N. Let M = {p-'B' | < i < iV}. As 
M contains the {N + 1)^* to {N' — iV)*'* levels of M', we may choose N' large enough and e' 
small enough such that the union of the sets in M has measure > 1 — e/2. 
From our definition of B', we have 

N'-N 

dist{P\B') = dist{P\ [j p-'D). 

i=l 

For any 77 > 0, by decreasing e' and increasing N' if necessary, we have 

\dist{P\B') - dist{P)\ < rj. 

If 77 > is small enough, we may remove at most e/2 fraction of B' to arrive at a set 
B such that dist{P\B) = dist{P) and B forms the base of a £-tree Rokhhn tower of height 
AT + l. ■ 
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2.4. Proof of the Isomorphism Theorem 



Unless otherwise specified, all m.p.s. in this section are p-endomorphisms. The purpose 
of this section is to prove the following theorem. 

Theorem 2.4.1. Let X = {X, B, /x, T) e End{p), then X is tvwB if and only if X ^ B+{p). 

Our first goal is to prove that -B^(p) is tvwB, which immediately proves half of Theorem 
2.4.1. To do this, wc will prove an extension of Lemma 2.5 in [H,R] to End{p), which shows 
that any tree-adapted factor of a tvwB p-endomorphism remains tvwB. Before we prove this 
(Proposition 2.4.3), a technical lemma is in order. 

Lemma 2.4.2. Given a m.p.s. X = {X, B, ji, T) , for each n E S > and B C X such 
that ijl{B) > 1 — S, there exists a set G of measure > 1 — V6 such that for each x E G, 

- w,Xb{T^x) > 1-VS. 



n 

0<|ti|<ra 



Proof: For each /c > 0, let 

Clearly, we have 
Hence, 



{x,B) = ^ Wy. 



Ijl{B) = / Wk{x, B) dii{x). 



/" 1 " 

H{B) = / - X] Wi{x, B) dn{x) >1-S. 

i=l 



So, 



1 

-y^Wi{x,B) >1-Vd 
n ^-^ 

i=l 

except on a set of measure at most \f5. We finish the result by observing that 

- y Wi{x,B) = - y w^XsiTyX). 



n — ' n 

i=l 0<\v\<n 



Suppose wc have a measurable function : X — > for a compact metric space {R,d). 
We say that g is tree-adapted if for a.a.x, g assigns different values to the inverse images 

oo 

of X. If V is the Borel cr-algebra of {R, d), we say that g is generating if ;B = V T"*gf"^(P). 
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It is clear that a generating g is necessarily tree-adapted. We now prove the extension of 
Lemma 2.5 in [H,R]. The proof follows along similar lines, though we will also need the fact 
that tree adapted maps preserve the p-function (Proposition 2.1.3). 

Proposition 2.4.3. Let X = {X,B,fi,T) e Endip) and let Y = {Y,C,u,S) be a tree- 
adapted factor of X. Suppose the function h : X ^ R is generating. If (X, h) is tvwB, then 
for any function g :Y ^ R' into a compact metric space {R\ d'), (Y, g) is tvwB. 

Proof: By normalizing d and d\ we may assume that the metric spaces R and R' have unit 

diameter. Let (j) : X ^ Y he a, tree-adapted factor map. Now, h is generating and g o (j) \s 
i3-measurable. Hence, for each £ > 0, there exist some sGN, 0<5<£ and a set of 
measure > 1 — e such that whenever x and x' are in and d{h{T'-x), h{T'-x')) < S for all 
< i < s, then d'{g{(f){x)), g{(f){x')) < e. 

Since (X, h) is tvwB, for a sufficiently large N, there exists a set G C X of measure 
> 1 — 5^ such that whenever z and z' are in G, 

i^iT^,T^,)<5'. (1) 

Moreover, by Lemma 2.4.2, we have a set G' C X of measure > 1 — ^/e such that for each 
xeG' 

^ WvXgATvX) > 1 - V^. (2) 

0<|t>|<n 

Fix a pair of points x and x' in G HG'. Using (1), we have some tree automorphism A & A 
such that 

J2 wJ{h{T,x),h{TAvx'))<5\ (3) 

0<|j;|<JV 

Let = {veT' \ d{h{T\T^x)), h{T\TAvX'))) > 5 for some < i < s}. Then by (3), 

^ J2 w,<{s + l)S + s/N. (4) 

veDi+ 

Let V = {v e T' \ V ^ DI+,T^x e G^,TavX' e GJ. Then notice that whenever v eV,hy 
our choice of S, 

d'{g{<P{T,x)),g{<P{TAy)))<e. (5) 

Note that the tree automorphism A corresponds to a bijection of the trees of inverse images 
of X and x'. This in turn pushes down via to a bijection of the trees of inverse images 
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of and (f){x'). Since is tree-adapted and hence preserves the p-function (proposition 
2.1.3), this last bijection in turn yields a tree automorphism A'. By (2), (4) and (5), we have 
1 . 1 

— wJ'{g{S^(p{x)),g{SA'v(p{x'))) = — ^vd\g{(p{T^x)),g{(t){TAyx'))) 



Q<\v\<N 0<|t)|<iV 



0<|u|<iV 0<|u|<iV 0<|»|<iV 



+ ^ E Wvd\g{(t>{T,x)), g{(t>{TA.x'))) 

vev 

< ^/e + y/e + {s + 1)5 + s/N + e < ^^/e 



for all sufficiently large N and small 5. 

As //(C n G) > 1 — 2-y/i, if £ is sufficiently small, then there will be a large set C y 
such that each y e H is the image of some point in G' (IG. Thus, the result follows from 
the preceding calculation. ■ 

Proposition 2.4.3 implies that if (X, /) is tvwB for some generating /, then (X, g) is also 
tvwB for any compact-valued function g. We thus see that the m.p.s. X is tvwB if and only 
if (X, /) is tvwB for some generating /. Hence, -B+(p) is tvwB since its standard generator 
(zero coordinate partition) yields the same tree name for all points in S+(p) by choosing the 
tree partition Kb+(p) to be the zero coordinate partition. We have thus proved the easier 
half of Theorem 2.4.1, which we record as the proposition below. 

Proposition 2.4.4. Suppose X = (X, B, /x, T) e End{p). If X ^ B+{p), then X is tvwB. U 

Our goal now is to show that if a p-endomorphism is tvwB, then it is one-sided Bernoulli. 
It will be convenient for our presentation to work with functions defined on p-endomorphisms 
with range in the metric space [0,1) (with the absolute value metric \x — y\). As we will see 
below, this gives us a fairly natural way to partition the range space and to define tree name 
distributions induced by a function on a p-endomorphism. For a m.p.s. Y = {Y, C, S) and 
two functions g, h : Y ^ [0, 1), let \g — h\ = \\g — h\\^i. 

For any function g : Y ^ [0,1) and G N, we will construct a "discretize" version of g 
which closely approximates it. Specifically, for each A^, let Pn be the set of dyadic intervals 
{[0, 1/2"^), . . . , [(2^^- 1)/2'^, 1)} of length 2-"^. Let be the set of midpoints of the intervals 
in Pn, i.e. = {{2t + l)/2^+\ 0<t< 2^}. Define the function g^ -.Y ^ Dj^hy setting 



2t + l ^ t t + 1 ^ 
9n{x) = ^;j^ ifxe [^,^), 0<^ <2^ 
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Clearly, \gN — g\ < 1/2^. Observe that Qn assumes values in the finite set so that we 
may regard as a partition of Y. 

Given a function g : Y ^ [0,1) and positive integers m and n, recall from §1.3 that 
we have the partition g^^ : Y — > [D^Y'' by mapping a point y to its ^f^-n-tree name and 
the partition g^ : Y (D^)"^ by mapping a point y to the equivalence class in (Z}„j)"^ 
containing g'^iy)- Let us also define the partition g"^^ : Y (-Dm)" by mapping a point y to 
its (7m-n-name. Observe that g'^J < g''^T, g27 < g"^ and g'^t < g^^ if n' < m' and n < m. 
The following proposition shows that closeness in tree name distributions implies closeness 
in forward name distributions. 

Proposition 2.4.5. Let Y = {Y,C,iy,S) e End{p). For n, m E N and any functions 

g,h:Y ^[0,l),if 

dtst{g:^)^dtst{h:^), 

then 

dist{g':^)^dist{hT). 

Proof: For each element C, G (-Dn)"* and a e (ZJ^)"*^, and for any representative Pea, let 
w{(, a) be the total weights of nodes v of length m such that {/3{v), P{crv), . . . , P{a'^~^{v)) = (. 
(This is clearly independent of the representative chosen.) Then 

J2 \dist{g::+){C)-dist{h::+){C)\< J2 E ^^(c,«)M^s^(5^'')(«)-c^^s^(/i^^)(«)| 

= J2 w{C,a)\dtst{g":^){a)-dist{h::^)ia)\ 
= \dtst(g':^){a) - distih':''){a)\ < e. 

m 

Let X = {X, B, ji, T) and Y = (Y, C, v, S) be p-endomorphisms. Suppose : X — > [0, 1) 
and /i : y — > [0, 1), we wish to define a t distance between the processes (X, g^) and (Y, /i) 
analogous to the d distance in Ornstein's Theory. To do this, we first let 

i„((X,^),(Y,/i)) = j UTlT^^)dix{x)dv{y). 

Then define 

?((X, g), (Y, h)) = liminf i„((X, g), (Y, h)). 
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We remark that this is not the definition in [H,R] in which they define t„((X, g), (Y, h)) 
as an infimum of the integral of i„(Tf,T^) over the set of one-sided couphngs of X and Y 
(see Chapter 3). Here, we only define it with respect to the product measure. Note that 
closeness in i essentially means that for some large n, t„(rf , r^) is small for any x in a large 
set in X and for any y in a large set in Y. 

The following proposition shows that if X is tvwB, then t((X, g), (Y, h)) is small provided 
that dist{g'^) is sufficiently close to distQi'^) for some n. The argument follows along the 
same lines as Lemma 4.4 in [H,R], though the notations found there need to be modified for 
our situation, since we are considering p-endomorphisms so that the weights of the nodes 
within a given level are not uniform. 

Proposition 2.4.6. Suppose X is tvwB and g -. X ^ [0, 1). For all e > 0, there exist 5 > 
and G N with the following property: for any Y G End{p) and any function h : Y ^ [0, 1), 
ifdist{g^^) dist{h%^), then t((X, g), (Y, h)) < e. 

Proof: Let 77 > 0, to be specified later. Since X is tvwB, we may choose N large enough 
such that there exists a set G of measure > 1 — 77 such that iN{T^, t^,) < rj whenever z and 
z' are in G. Clearly, we may assume that 1/2^ < 77. Fix x' G G. If dist{g^'^) dist{h%^), 
then we have a set if in Y of measure at least 1 — 277 such that if y G if, then 

^iv(T.^T^'')<77 + (l/2^)< 277. 

Let r' = r^,. Create a tree name r : T' — >■ [0,1) by tiling with copies of r' as follows: If 
< < and \v'\ = kN for some integer k > 0, we define r by setting r{vv') = t'{v). 
We would like to show that r is close to and for large sets in X and in Y in ikN for 
all A; G N. Roughly speaking, the idea is to think of each tree of height kN as consisting of 
subtrees of height N and to realize that the tvwB condition guarantees that the tree names 
of these subtrees are close to r' in tjv on average. 

We will now define inductively, A^-levels at a time, a tree automorphism A which makes 
ikNiT^,T) small on a large set in X for all k. For x e X, let he a tree automorphism 
which realizes the minimum in the definition of ij^{T^, r). For < \v\ < N , set A{v) = A^{v). 
Inductively, for each v & T such that jN < \v\ < {j + 1)N, j > 1, write v = v'u for unique 
nodes v' and u such that \u\ = jN and < < A^, and define A{v) = Arp^^{v')A{u). 

For j >0 and x e X, let Sj{x) = {v e T \ \v\ = jN and T^x G G}. Let w{Sj{x)) denote 
the sum ^y^Sj{x)'^^- Then by Lemma 2.4.2, for all k > 0, there exists a set Gk Q X oi 
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measure > 1 — ^/jy such that 

k-l 



^J]w(S',(a;))>l-V^ 

3=0 

for each x & G^- By our construction of the tree automorphism A, and r' are matched to 
within rj in In whenever y = T^x for v e Sj{x). For each x e Gfc, by calculating ikN{T^, t) as 
an average of iN{T^, t') over all y G T^'^x, < j < k — 1, we have 



j=0 \v\=jN 
^ fc-1 

3=0 veSj{x) v^Sj{x) 

^ fe-1 

iVwC-^iC^)) + N{1 - w{S^{x))) 

3=0 

fc-1 

<l-(l-V^)(l-r7). 

Hence for all but a set of measure 2^ in X, we have ik^iT^, r) < 2y^. 

By the same argument, for all but a set of measure in Y, we have ikN{Ty, r) < 4y^. 
Hence, by choosing r] sufficiently small, we have 

ikNir!^,ry)dfiix)diyiy) < J tkN{r^^,r) d^i{x)dv{y) + J i^N{r,Ty)dii{x)dv{y) 

< iy/rj + 807 < e 
Since k is arbitrary, the result now follows. ■ 

Let X = (X, B, fi, T) and consider a partition P : X — > C for some finite set C. Given 
M e N and a; e X, recall from §2.2 that we defined a measure 9x,m,p on C by 

dx,M,p{c) = 

{«erj(^|p(T„x)=c} 

for each c G C. Also recall that a; G X is £,M-generic for P if 

\Ox,M,p - dist{P)\ < £. 

Note that if X is ergodic, the Tree Ergodic Theorem implies that for all sufficiently large M, 
there is a large set G C X such that all points x e G are £,M-generic for P. 
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Combined with proposition 2.4.6, the next proposition (the perturbation lemma), shows 
that if X is tvwB and ?((X, /), (Y, g^)) < £, then we only need to perturb g slightly (depending 
on e) to get a function h onY such that the processes (X, /) and (Y, h) are as close in t as 
we like. This will be the key observation in the proof of Sinai's Theorem (Proposition 2.4.9). 

For the proof of the perturbation lemma (Proposition 2.4.7) and the copying lemma 
(Proposition 2.4.12), we will be constructing a function on a chosen tree Rokhlin tower M 
using one or more chosen tree names in a way analogous to "painting" columns of a Rokhlin 
tower with a name as in the proof of Ornstein's theorem. Since this construction is central 
to the proofs, let us now define it exphcitly. Suppose Y = {Y,C,v,S) e End{p), e N 
and A e An- Given a tree name t : ^ R and a point y & Y, we may define a function 
h : {S-^y \ l<t<N}^R such that h{S^y) = t{Av) for ell v E Tj^^. We will refer to h as 
a laying of r on {S~^y \ 1 < t < N} via A. Doing this for all y in the base S of a tree 
Rokhhn tower M of height N + 1 ior fixed A e An and tree name r defines a function on 
[JM\B. 

Proposition 2.4.7 (Perturbation Lemma). Let X = {X, B, T) and Y = {Y, C, S) be 

ergodic. Suppose S > and f : X ^ [0, 1). For any e > 0, N e N and for any function 
g : Y ^ [0, 1) such that ?((X, /), (Y, g)) < S'^ , there exists some function /i : y — > [0, J) such 
that \g-h\< 85 and dist{h^^) ^ distif^^). 

Proof: There is no loss of generahty in assuming that N is large enough such that 1/2^ < 5'^. 
Let rj < 6 he specified later. Choose M e N and a set G C X of measure > l — rj such that 
the following conditions hold: 

a) N/M < 77/2 

b) X is ?7,(M — A^)-generic for f^^ for all x e G 

c) JiM{r^,T^y)di2{x)du{y)<6\ 

The Tree Ergodic theorem and the hypothesis show that such an integer M and set G exist. 
From c), we have 



except on a set C X of measure < 5^. Choose some point x' G G\H. It follows that on 
a set G' C Y oi measure > 1 — 5, we have iM(Ti/,Tp < 5 for y e G'. By the Strong Tree 
Rokhlin Lemma, we may build a r]/2-tree Rokhlin tower M of height M + 1 in Y with base 
B such that Ub{G') >1-5. 
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We now construct the required function h. For each y e Y, let Ay e Am be a tree 
automorphism which reahzes tM{Ty,T^,). For each atom a in the partition g^'^\B, we pick a 
representative point y{a) e a. We define h on UM\S by laying rf, on {S~*y \ 1 <t < N} 
via Ay(^a) for each point y & a and then for each atom a. Note that for a fixed node v and 
a fixed atom a in g^'^\B, h is constant on S^a, thus h defines a measurable function on 
UM\i?. We then extend h measurably to the rest of the space in any way we desire. 

For each atom a G g^''\B, let Ca be the column of the tower over a. Let us say that 
is a good column if a contains a point in G'. Now, ii y e G' B and y & a, then since 
1/2^ < S, we have 

^M^f^), riO < tM«(„), riO + 6 < tM{r^, t^) + 26< 36. (1) 
For each good column Ca, a simple calculation using (1) and our definition of h shows that 

J \gM{z)-h{z)\ dv{z) <35u{Ca). 

Co, 

Since UsiG') > 1 — 5, conditionally on B, the bases of the good columns is a set of measure 
> 1 - (5. Thus, 

i/({UC„ I Ca is good}) > (1 - 5)z/(Ufii5"*£) > 1 - 25. 

Hence, 

\g-h\<\g- gM\ + \gM - h\ 
< 1/2^ + 55 < 85. 

For the second part of the conclusion, fix an atom a e h^'^\B. Then there exists some 
tree automorphism A such that h^iSyy) = fNiTAv^') for all y e a and for all v e 7^. Hence, 
for each { e (Dn)^^, if fj^^{T,x') = ^ ior v e T^.^, then h^^{SA-^vy) = C ior all yea. Let 
^ = {v e "^M-N I fN^{TyX') = Since tree automorphisms preserve weights of nodes, we 
have 

dzstihn u s-^am = y 

^ ^ ' ^ (M - N)iy(a) 

= 6'x',M-iV,/J;fv(^). 

Thus, 

dist{h%^\ y S~^a) = 9^,^M-Nj^^- 

0<j<M-N 
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Obviously, this is true for each a e h^'^\B. Hence, (b) imphes that 

dist{h^^\ U S-'B)^dist{f^^). 

0<j<M-N 

Using (a) and the fact that the measure of UM is large, we also have 

dist{hr)'^dist{fr) 
for ri small enough. This completes the proof. ■ 

By imitating the proof of the previous proposition, but without the need to make h close 
to a predefined g, the following proposition is immediate. 

Proposition 2.4.8. Suppose X = {X, B, ji, T) and Y = (Y, C, v, S) are both ergodic p- 
endomorphisms. For any function f : X ^ [0, 1), and for all e and N, we have some function 
h:Y ^[0,1) such that dist{h^^) ~ dist{f^^). ■ 

For Y = (F, C,i/, 5') and a function : y ^ [0, 1), we have a map g^' Y — > [O,!)"^ 
defined by g^' (y) = {g{y), g{Sy), . . .). The measure u pushes forward via g^' to the shift 
invariant measure dist{Y,g) = v o {g^')~^ on [0,1)^ . We will refer to the map g^' as the 
g-name map. 

We will now prove the analogue of Sinai's theorem in Ornstein's theory for tvwB p- 
endomorphisms. In fact, we will need a slightly stronger version of it. 

Proposition 2.4.9 (Strong Sinai's Theorem). Supposed = {X, fx,T) is tvwB. Let 
f : X ^ [0, 1) be a generating function. Given e > 0, there exist S and N such that if 
Y = (y, C, S) is ergodic and if g : Y ^ [0, J) satisfies dist{g^^) ~ dist{f^^), then there 
exists a tree-adapted function /i : y — > [0, J) such that dist{Y, h) = dist(X., f) and \h — g\ < e. 

Proof: By Proposition 2.2.1, X is ergodic. The strategy is to apply Propositions 2.4.6 and 
2.4.7 repeatedly. Precisely, choose N^. y oo and e^. \ such that for each k, N^. and 
correspond to in Proposition 2.4.6 applied to (X, /). We show that if N = Ni and 

S = El, the result holds. 

To see this, notice that if 

dist{g^f)'r^dist{f^^J), 

we have by Proposition 2.4.6, 

t{{X,f),{Y,g))<{e/16)\ 
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Hence, by Proposition 2.4.7, we have some function on Y such that \g — < e/2 and 

Inductively, we obtain a sequence of functions g^''s such that \g^ — g'-'+^j < and 

dist{{g^f^;)'ridist{f^;^). 

Hence, the approach some function h pointwise a.e. Note that \g — h\ < e. 

To see that dist(Y, h) = dist(X., /), note that for any G N and any ^ > 0, we have by 
proposition 2.4.5, 

dzst{{g^)^+)^dzst{f^+) 
for all large j. Thus, for any given 7 > 0, we have 

dist{Y, g') ^ dist(X, f) 

in the i(;*-topology for all large j. Now, as g^ — > h a.e., we also have for large j, 

dist(Y, h) ~ dist(Y, g^) ~ dist(X, /). 

Since 7 is arbitrary, we have dist(Y, h) = dist(X., /). 

It remains to show that h is tree-adapted. By construction, for any G N and 7 > 0, 
dist{{g^)]^) dist{f]^) for all large j. As g^ h, it immediately follows that for any M G N, 
dist{h]^) = dist{f^). Since / is generating and hence tree-adapted, so is h. ■ 

The following is an immediate consequence of propositions 2.4.8 and 2.4.9. 

Corollary 2.4.10 (Sinai's Theorem). Suppose X = {X,B,ij.,T) is tvwB. Let f -. X ^ 
[0, 1) be a generating function. IfY = {Y, C, u, S) is crgodic, then there exists a tree-adapted 
function /i : y — > [0, J) such that dist{Y, h) = dist(K., /). ■ 

For a function gonY, consider the gi-name map g^' : Y — > [0,1)'^ defined previously. 
Similarly, if / is a function on X, consider the /-name map f^' : X [0,1)'^ . If dist(X., f) = 
dist(Y, g) for a generating / on X, then we have a factor map tt : Y X defined by 
= if^*)~^ ° 9^' ■ Note that tt is tree adapted if (7 is a tree-adapted function on Y. Thus, 
Sinai's Theorem implies that if X is tvwB, then it is a tree-adapted factor of Y. 

Before we establish the copying lemma. Proposition 2.4.12, we need the following result 
which shows that if dist{Y, g) = distQ^, f) for a generating / and a tree-adapted g as in 
Sinai's Theorem, the tree name distributions induced by g and / are the same. 
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Proposition 2.4.11. Suppose X and Y are in End{p). Let / : X — > [0, J) be a generating 
function. Suppose g : Y ^ [0, 1) is a tree-adapted function such that dist{Y, g) = dist(X., /). 
Then for allM eN, 

distigr) = dist{fr)- 

Proof: This follows directly from proposition 2.1.4 applied to the tree-adapted factor map 
TT : Y — > X defined by tt = {f^*)~^ o g^' and on noticing that gM = ° ^ 

lih,h' :Y ^ [0, 1), let hVh' be the joined function defined by hVh'iy) = {h{y), h'{y)). By 
analogy with our definitions of h^'^ and /i^'^, we can construct the partitions {Km V /^m)^^^ : 
Y {Dm X Dm)^^ and Qim V h'^)^^ ■ Y {Dm x Dm)^"^ by discretizing hVh' ushMy h'^- 

Proposition 2.4.12 (Copying Lemma). Let Y = {Y,C,v,S) e End{p) and suppose 
X = {X, B, ji, T) e End{p) is ergodic. Let f X ^ [0, 1) and g Y ^ [0, 1) be generating. 
Suppose f is a tree-adapted function on Y such that dist{X., f) = dist{Y, /') . For all e and 
N, there exists a function g' on X such that 

dist{U V g'j^f^ ~ dist{f^ V g^f". 
Proof: By proposition 2.4.11, we have 

dist{f'Z^) = dist{fr)- 

Choose M such that N/M < e/2. Construct a e/2-trcc Rokhlin tower M of height M + 1 
in X. Let B be the base of M. By the Strong Tree Rokhlin Lemma, we may assume that 
is independent of B. Hence, dist{f^}^\B) = dist{f'^). Let tt : Y ^ X denote the tree 
adapted factor map defined by tt = (/'^*)~^ o {f')^'. 

Fix an atom q of Jm^. Define a partition Pg : q n B ^ {Dm x Dm)'^^ such that 

dist{Pg) = dist{{f'^ V gM)''^K\q))- 

This gives a bijection (/? of the atoms of Pg and those of {J'm V 5'M)^^|7r~^(g) such that cor- 
responding atoms have the same conditional measures and are mapped to the same element 
in {Dm X DMr^". 

For each atom a e Pg, choose any representative point y e (f{oi). Note that as /m = /m°^ 
and 7r(y) e g, by proposition 2.1.4, fM^{z) = f'^{y) for all z & a. For each z & a, choose a 
tree automorphism A e Am such that fM{TvZ) = fMiSAvV) for each v e TJ^. We then define 
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g' by laying on {T~^z \ 1 < t < M} via A. (We can ensure the measurability of g' by 
choosing the same tree automorphism for all 2; e a which are in the same atom of /m"".) 
This defines the function g' on the column over a. 
Clearly we have for every z & a and v e 7^, 

/m V g'MiTyZ) = /m V guiSAvy): 

so that 

(/m V g'^f'^iz) = if^ V gMf^'iy) = P,{z). 

Repeat the above procedure for each a e Pg. We then have (/m V g'M)^'^^ = Pg on qHB. We 
thus have 

dtstiifM V g'M f^lq nB) = dtstiifj^ V gM f^K\q)) (1) 

We then extend g' to the rest of the tower by following the above procedure for each atom 
q of /m^. We thus have a function g' defined on UM\S. Extend g' measurably to the rest 
of the space in any way we like. 

We wish to prove that dist{fN V g'^)^^ ~ dist{f'j^ V gNY"^^ ■ First, let us observe that by 
our construction and by (1), we have 



dist{{fMVg'^)^''\B)= J2 l^{qnB\B)dist{{fMVg'^)''^\qnB) 

= J2 mdism'^y gM)''''\-K-\q)) 



(2) 



= dist{f'^ygMr^. 

We claim that for each < j < M - N, 

dtstiiU V g'^f^\T-^B) = distif^ V g^f"". (3) 

To see this, note that if j3 and /3' are corresponding atoms of (/m ^ q'm)^'^ (/^ ^ gm)'^^ i 
then 

distiUN V g'j,f^\T-^l3 n T-^B) = distiU'^ V g^f''\S-^l3'). (4) 



Moreover, by (2), we have 



pi{T-^P\T-^B) = MB) = u{P') = u{S-^P'). (5) 
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Thus (3) follows from (4) and (5) since the distributions in (3) are just weighted averages of 
the distributions equated in (4). As N/M < e/2 and the rest of the space is a set of measure 
< e/2, we see that 

distiU V g'X'' ~ distif'^ V gj.f''. 

This finishes the result. ■ 

For Y = (y, C, S) e End{p) and partitions P, i? : y — > C for a finite set C, we write 
P~i? if {?/ e Y I P{y) ^ Riy)} has measure < £. For two partitions P : Y ^ C and 

s 

Q : Y ^ C , we write P cQ ii there exists a partition R -. Y ^ C with R < Q such that 

Given a function g -. Y ^ [0,1), let X^lfi*) denote the sub-sigma-algcbra of C generated 
by g, i.e. J2i9) = \IT=o^~^ ^9~^(^)) ^'^'^ Borel sigma-algebra V of [0,1). If gf and h are 
functions on y, we write h C X/(9') if there exists some integer M such that C S'm ■ We 

e,JV 

write /i C if /i is X](fl')-iii6asurable. Clearly, h C X](5') if and only if /i C X](5') for all 

e and A?". 

For the remainder of this section, let X = {X, B, ji, T) and Y = (Y, C, v, S) be tvwB 
p-endomorphisms. Moreover, we fix generating functions / : X — » [0, 1) and g : Y [0, 1). 
Before delving into details, we briefly describe the strategy to proving Theorem 2.4.1 from 
this point on. The idea is to construct a Cauchy sequence of functions {g'^} on Y such 
that dist(Y,g^) = dist(X., f ) for each j, and which converge to a function ^ on y such that 
g C J2i9)- Thus dist(Y,g) = distQ^, f ) and J2i9) = ^12(9) = ^- Now, the sigma-algebra B 
pulls backs to "^{g) via the factor map tt = {f^')~^ o g^' : Y ^ X as constructed prior to 
Proposition 2.4.11. It thus follows that tt is an isomorphism as 7r~^ gives an isomorphism of 
(7-algebras of Lebesgue spaces. As we have previously remarked, B~^{p) is tvwB. Theorem 
2.4.1 thus follows. 

For technical reasons, we will choose the generators / and g such that {x e X \ f{x) = q} 
is a null set for each rational q, and similarly for the sets {y & Y \ g{x) = q}. Observe that 
we can always do this as X and Y are nonatomic Lebesgue spaces. (Just choose any point 
isomorphism / : X [0,1) and g : Y [0,1).) This assumption is needed to ensure that 
for the functions we construct via the copying lemma, a large enough set of points assume 
values sufficiently bounded away from the boundary points in the dyadic intervals of length 
1/2^ in Pj^. For any such function /i, if |/i — /i| is sufficiently small, then h^ix) = hN{x) for 
all x in a set of large measure. 
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Proposition 2.4.13. Let f :Y ^ [0, 1) be a tree-adapted function such that dist{Y, /') = 
dist(X., /). Then for any rj > 0, e > and M e N, there exists a tree-adapted function 
/ : y ^ [0, J) such that 

e,M „ - 

a) Eif) 

b) \J-f'\<V 

c) dist{Yj) = dist(XJ). 

Proof: Let e N be specified later. By Proposition 2.4.12, for any S and L, we have some 
function g' on X such that 

dtstih V g'^f^ - dtstifl V gi^f"". (1) 
As /' c'Eig), for each ^ > 0, we can choose some k such that 

Now, by our choice of g, {y E Y \ g(y) = g} is a null set for each rational q. Thus, for 
any > 0, there exist open intervals containing the rationals {t/2'' | t = 1, . . . , 2*" — 1} such 
that g{y) assumes a value in one of these intervals on a set of measure < /3. Thus, (1) also 
implies that g'{x) assumes a value in one of these intervals on a set of measure < 2/? for all 
sufficiently small 5 and 1/L. If /3 > is small enough, we may choose p > such that for 
any function g with \g' — g\ < p, g'kix) = gk{x) on a set of sufficiently large measure to give 

distih V g'^f"" ~ distiU V -gkf''. (2) 

From (1), dist{g''^)'^ dist{g^). By the Strong Sinai's Theorem, if 5 and 1/L are chosen 
small enough, then we may choose g with \g' — g\ < p, dist(X., g) = dist{Y, g), and (2) holds. 

Once more by proposition 2.4.12, for any L' and 5', we have some function fonV such 
that 

distih^ V gr^^f^ ~ distih' V gr^^f^. (3) 

Using the Strong Sinai's Theorem again, for any < p' < we may choose 6' and L' to 
give a tree-adapted function f onY such that | / — / | < p' < ^ with dist{Y, f) = dist(K, /). 
This gives (c) of the proposition. 

Choose A;' e N such that gMCf^,~^. By (3) and decreasing 6' and 1/L' if needed, we 
have gM C f^','^. Once again, since {x E X \ f{x) = g} is a null set for each rational q, we 
have gM C f^i^ by choosing p', 5' and 1/L' small enough. This gives (a) of the proposition. 
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It remains to prove (b). Now, 

I/' - /I < I/' - /;i + 1/; - h] + \fN -f\ + \f-j\ 
< 1/2^ + 1/;-/^! + 1/2^ + ^. 

Using (1), (2) and (3) and ensuring min(L, L') > k, and 5, 5' small enough we have 
Thus, 

IB 

Since Cgk''^, we may also conclude that \f'j^ — }n\ < 36*. We may now conclude b) by 
choosing 9 and in the beginning to satisfy 49 + l/2^~^ < rj. M 

We are now ready to prove Theorem 2.4.1. 

Proof (Theorem 2.4.1): The idea is to use Sinai's Theorem and Proposition 2.4.13 re- 
peatedly to construct a Cauchy sequence of functions {g^} on Y converging to some func- 
tion g pointwise a.e., and dist{Y,g^) = o?isi(X, /) for each j, from which it follows that 
dist(Y,g) = dist{X.,f). We now need to ensure that the functions {g^} are chosen in such 
a way that g C ^{g)- Prom this, we may conclude that ^{g) = '^{g), thus concluding the 
proof of Theorem 2.4.1. 

To carry out the above plan, choose a sequence of reals Sj \ and a sequence of integers 
Mj y oo. Let rjj \ be a sequence of reals, to be specified later. Using Sinai's Theorem, we 
begin by choosing a tree-adapted function g^ on Y such that dist{Y, g^) dist(K, /). Using 
proposition 2.4.13, we construct a sequence of tree-adapted functions {g'} on Y, j > 1, such 
that 

i) dist{Y,g^) = dist(KJ), for j >1 

ii) - 9^^^ < Vj: for j >1 

iii) 9 C forj>l- 

By ii), we may obviously arrange the Ty^'s such that the gi^'s converge to a function g pointwise 

ej,Mj 

a.e. Thus dist{Y,g) = dist(X.,f). Since g C '^{g^), for j > 1, we have some integer kj 
such that 

Now as dist{Y, g^) = dist(X., f) for each j, the fact that {x E X \ f{x) = g} is a null set for 
each rational q implies {y E Y \ g\x) = q\ is also. Hence, we have some ^„ > for each 
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n > 1 such that whenever 



oo 



J2\g'-9'^'\<0., 



(1) 



we have 




2e, 



For any m e N and £ > 0, it follows that for all large n, 



9 c Eig)- 



Wc thus have ^((/) = ^((?), which gives Theorem 2.4.1. 

Hence we are done if we can choose the g^^s to satisfy the inequalities specified in (1) for 
all n > 1. To see that this is possible, note that they are chosen in the order ^ ^ . . . . 
In view of this, when ^„ is determined, only g^, . . . ,g" have been chosen and g"'^^ has not 
been chosen yet. Now, rjn only need to be chosen when we choose g"'^^. As a result, rjn has 
not been declared at the time we specify 

Thus, we may choose rjn < min(^, . . . , ^), it follows that 



Thus the inequalities in (1) can be satisfied. This completes the proof of Theorem 2.4.1. ■ 

2.5. An Elementary Proof of a Special Case of Theorem 



In this section, we give a direct proof of Theorem 2.4.1 in the case that p = (Pi, • • • 
is a probability vector such that pi > . . . > Ps. A simplified proof exists in this special case 
because the group A of tree automorphisms is trivial. 

Let X = {X,B,iJi,T) e End{p). Let / = {1, Consider the one-sided Bernoulli 

shift -B+(p) represented as the shift space {I^\C, v, S) such that the j-th symbol has weight 
Pj. We then have a canonical factor map : X — > -B+(p) defined by setting ^(x)^ = j if 
Px{T''x) =pj, for A; > 0. 

Recall that a node of the p-tree T is a finite sequence of integers in {1, . . . , s}. For each 
node V and a point z in I^' , let vz be the point in I^* obtained by concatenating v to the 
left of z. For each z in I^' , let z[0, m] be the cylinder set 




2.4.1 



{z' e 7^* I Zj = Zj for 0<j< m}. 
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Our goal is to prove the following special case of Theorem 2.4.1. 

Theorem 2.5.1. Let p be a probability vector with pairwise distinct components. Let 

X = {X,B, fi,T) G End{p) be tvwB, then the canonical factor map : X — -B+(p) is an 
isomorphism. 

To prove this, we need a prehminary lemma. Recall that for X = (X, B, //,T) and 
Y = {YjCjiy, S) , if : X — > Y is a factor map, then we have fiber measures /ly supported 
on ip~^{y) with the property that /i = J /ly dv{y). 

Lemma 2.5.2. Consider the one-sided Bernoulli shift B^{p) = (J^^*, C, z/, S*). Let X = 
{X, B, fi,T) G End{p). If D C X and v eT, then for a. a. z in I^' , ^^^(TyD) = Hz{D) {i-c. 
T'^l : {X, iiyz) —>■ {X, Hz) is measure preserving). 

Proof: This follows easily from the fact that 

fi{T,B\T,C) = fi{B\C) 

for positive measurable sets B C X and C Q X. M 

Proof (of Theorem 2.5.1): Let / : X — > [0,1] be generating. By assumption, (X,/) is 
tvwB. It suffices to prove that for a.a. z in I^' , there exists some r G [0,1] such that for //^-a.a. 
X, f{x) = r (i.e. / is constant on fibres). Indeed, this implies that / is 0~^(C)-measurable. 
Since / is generating, wc thus have B = (f)^^{C) and so (f) is an isomorphism. 

Wc proceed by contradiction. Hence, we suppose that there exist r] > 0, a set Z C I^' 
of positive measure and two disjoint intervals J and J' in [0,1] separated by a distance of at 
least T] such that for each z & Z, /i^ix & X \ f{x) & J) and //^(a; G X | /(x) G J') are both 
at least rj. We may clearly assume rj < v{Z). By the Tree Ergodic Theorem, there exists 
N such that for all n > N, we have a set X C L^' of measure at least rj such that for 
each z E K, 



1 J^iZ) ri 

{0<\v\<n\vzeZ} 



We now show that the integral 



/ 



tnir;[,T^)dHzix)dHziy) 



is bounded away from zero for z E K, n > N. To see this, for v E T, by Lemma 2.5.2, 
J \r^{v)-T;{v)\ dHz{x)dnM = J \f{nx)-f{ny)\ 

l/(^) - f{y)\ diiyz{x)diiyz{y)- 



I 
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Hence, ii z E K ior n > N , then from (1) and the fact that A consists of only the identity 
automorphism, 



0<\v\<n 

1 

> 



- \f{T^x) - f{T^y)\ dii^{x)dii^{y) 

{0<\v\<n\vzeZ} 

= - XI I \f{x) - f{y)\ dn^^{x)dn^^{y) 

{0<\v\<n\vz&Z} 
n ' ^ 



n 

{0<lu|<n|i>z6Z} 

where the second last inequality follows from the fact for each vz in Z, we have two disjoint 
sets U and U' with measures at least r] and \ f{x) — f{y)\ > rj whenever x E U and y E U'. 
Hence, 

J Uri,T^)dfx,{x)dfxM>vV'2 

for all ^ e n > N. 

We will now use the fact that (X, /) is tree very weak Bernoulli to arrive at a contra- 
diction. Let < £ < 1 be specified later. Choose n corresponding to £ in the definition of 

tvwB for (X, /). We may assume that n > N (for the N chosen in the last paragraph). By 
definition, we have a set G of measure > 1 — e such that for all x and y in G, t„(r^, r^) < s. 
Now, we have 

/x(G"=) = J fx,{G') dv{z) < e. 

Hence, 

for all ;2 in a set G' C of measure > 1 — \/£. Consequently, we see that for each z in G', 

tn{rl, r/) dfx,{x)dfx,{y) <e + 2y/e. 

Choosing e > sufficiently small ensures that z/(7^ fl G') > 0. Moreover, choose e such that 
s + < ?7'*/2. Now, if z E K n G', we have on the one hand, 

J UT^,T^)dfX,ix)dfxM<vV'2 

as z E G', while 

J UT^,T^)dii,{x)diiM>vV'2 
as z E K. This is a contradiction and thus completes the proof of the theorem. ■ 
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Chapter 3: 

A Joinings Proof of the Isomorphism 
Theorem 

The goal of this chapter is to present a joinings proof of the isomorphism theorem, The- 
orem 2.4.1, in the previous chapter. The proof is a modification of Hoffman and Rudolph's 
([H,R]) proof of the isomorphism theorem in the case of the uniform probability vector. 
While the joinings proof is more technical in certain aspects than the proof presented in the 
previous chapter, it has the advantage of showing that there are uncountably many auto- 
morphisms of S+(p), unless the components of the probability vector p are pairwise distinct 
(see Proposition 3.4.2). Most of the definitions and theorems below are modelled after Hoff- 
man and Rudolph. There are, however, two modifications that we will make to their proof 
which will allow us to extend their arguments to the general probability vector. First, we 
will need to extend the definition of one-sided joinings introduced in [H,R]. In particular, 
we need to impose an additional condition in the definition of a one-sided joining which is 
trivial when p is uniform. Second, our statement of the copying lemma will differ from that 
in Hoffman and Rudolph ([H,R]'s Lemma 5.4). In our presentation, we will need to copy tree 
distributions induced by the functions on the p-endomorphisms under consideration. This 
approach will save us from tackling the technical issue of whether dist and tdist generate 
the same topology on tree-adapted functions on p-endomorphisms for a general p ([H,R]'s 
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Lemma 3.6). 

This chapter is organized into four sections. In §3.1, we define the notion of a one-sided 
joining of two p-endomorphisms and discuss some topological properties of such joinings. 
§3.2 is devoted to the proof of the copying lemma, which is similar in form to Proposition 
2.4.12 but it is more involved. §3.3 examines the t distance between two processes, as 
defined in [H,R]. §3.4 contains the proof of Theorem 2.4.1, using the machinery of one-sided 
joinings developed in §3.1 to §3.3. The idea is to show that for a tvwB p-endomorphism 
X = (X, /jL, T) and for -B+(p) = (F, C, S), the set of one-sided joinings A such that B = C 
(the isomorphic joinings) is a dense Gs in the space of one-sided joinings, which is a compact 
metric space in the ty*-topology. The Baire Category Theorem then imphes that the set of 
isomorphic joinings is non-empty, provided that the space of one-sided joinings is non-empty 
(which will indeed be the case). However, each such joining gives an isomorphism and so 
X ^ B+{p). 

3.1. One-sided Joinings 

Let p = (pi, . . . ,Ps) be a fixed probability vector such that pi > ... > Ps- Let po = 1 
and assume that = Sq < Si < . . . < Sr = s are chosen such that for all < z < r — 1, 
Psi+i = . . . = Ps^^-, and Ps. > Ps-^^. Define the probability vector p by summing the identical 
components of p, i.e. 

Si Sj. 
i=l Z=Sr-l+l 

Let /(p) = {1, . . . , r}^' and assign j with weight equal to the j-th component of p. Construct 
the one-sided BernouUi shift -B+(p) = (7(p), m, cr) where m is the product measure and a 
is the shift defined in the usual way. Given X = {X,B,iJt,,T) e End{p), define a map 
■ X ^ Hp) by setting z/'xla^)* = j if Py^iT^x) = p^.. Note that if p is a uniform probability 
vector, ipy^ is a constant map into a single point system. For two m.p.s. X = (X, B, ^, T) and 
Y = (y, C, z/, S") , a coupling of X and Y is a measure A on the product space (X x F, i3 x C) 
such that \{B xY) ^ fi{B) for all C X and A(X x C) = u{C) for aU C C F (i.e. A has 
marginals // and u). A joining of X and Y is a coupling A which is also T x (S-invariant. 
For brevity, for a set S C X, we will also use B to denote the subset B xY in the product 
space X x y, with similar convention for a set C C y. (It will always be clear from the 
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context whether B C. X refers to B or B xY.) 

Two examples of joinings are worth mentioning at this stage. First, the product measure 
/I X 1/ is always a joining of X and Y. Second, the diagonal measure Xa on X x X defined 
by Xa{B X C) = Ijl{B n C) for any B <Z X and C C X is a self-joining of X. 

Definition 3.1.1. For X = (X, B, T) and Y = (Y, C, u, S) in End{p), a one-sided cou- 
pling (joining) A of X and Y is a coupling (joining) of X and Y such that 

i) for any pair of compact-valued generating functions f : X ^ R and g : Y ^ U, if 

= / o T' and g'' = g o S\ then for each j > 0, we have 

Dist,i{f%^,^j\{r}j^„ {^'},<,) = Dist,{{f%^,^j\{f}j^,) 

and 

Dist^i{g'}o<i<j\{f}j<i, {g'}J<^) = Distx{{g'}o<,<j\{g'}j<i) 

ii) A projects to the diagonal measure on 7(p) x 7(p) via the map ■^x x V'y : -'^ x ^ — 
7(p) X 7(p). 

Some remarks on definition 3.1.1 are in order. First, condition i) holds if the equations 
hold for some pair of generating functions. Indeed, note that the equalities in i) hold if and 
only if 

Dist,{{f%^o\{rh<i,{9'h<i) = ^^s^A({/'}i>o|{f L<i) 

and 

Dist,i{g'},>o\{r}j<i: {9'}j<i) = Dist^i{g'},>o\{g'}J<^)■ 

Since the sigma- algebras generated by {P}i>j and {g'^}i>j for any j > do not depend on 
the choice of the generators, condition i) is just the Hoffman and Rudolph definition ([H,R]'s 
Definition 3.2). In the case that A is T x ^'-invariant, condition i) essentially says that for 
A-a.a (x, y), if x' is a preimage of x, then the conditional probability of x' given {x, y) is just 
the conditional probability of x' given only x. Condition ii) essentially says that for A-a.a 
(x, y), the px-name of a; = pY-name of y. Thus if p is a uniform probability vector, condition 
ii) always holds and Definition 3.1.1 is just the Hoffman and Rudolph definition. 

For brevity, given X, Y e End{p), let C(X, Y) and C~^(K, Y) denote the set of couplings 
and one-sided couplings of X and Y. Similarly, let J(X, Y) and J+(X, Y) denote the set 
of joinings and one-sided joinings of X and Y. By viewing the set of probability measures 
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on {X xY,B X C) as a subset of bounded linear functionals on the continuous functions on 
X X y , we have a metrizable topology, the u'*-topology, on the set of couphngs and joinings. 
It is standard that C(X, Y) and J(X, Y) are i(;*-compact. 

We will now establish some basic facts of one-sided couplings and joinings which will be 
used in subsequent sections. 

Proposition 3.1.2. C+(X, Y) and J+(X, Y) are w*-closed convex subsets of C(X, Y) and 
J(X,Y) respectively. 

Proof: The statement that J+(X, Y) is a w*-closed convex subset of J(X, Y) easily follows 
from the corresponding statement for C+(X,Y) and C(X,Y) and the fact that the set of 
joinings is a closed convex subset of all couplings. Hence, it is enough to prove that (^"""(X, Y) 
is a it;*-closed convex subset of C(X, Y). 

For convexity, notice that condition i) of Definition 3.1.1 says that for each t > 0, each 
one-sided coupling couples B and S~*C independently when conditioned on T~*B. As each 
one-sided coupling projects to /i on B, this implies that conditionally on T~*B, each one- 
sided couphng can be viewed as a product measure of the form fj, x Ui on B x S~*C. It 
follows that any convex combinations of one-sided couplings is still a product measure of 
this form on B x S~*C conditionally on T~*B. Thus any convex combinations couples B and 
iS^'C independently over T^^B. By symmetry, C and T^'B are also coupled independently 
when conditioned on S^^C. Thus, condition i) holds for convex combinations. The fact that 
condition ii) of Definition 3.1.1 also holds for convex combinations of one-sided couplings is 
obvious since any convex combination of measures projecting to the diagonal measure on 
/(p) X 7(p) also projects to the diagonal measure. Hence, one-sided couphngs are convex. 

We prove closure. Fix i e N. Let Aj e C+(X, Y) be a sequence of one-sided couplings 
such that Aj — > A in w*. Clearly, A e C(X, Y). Suppose P is a finite partition of X, and Q 
is a finite partition of Y which is 5'~*C-measurable. Choose finite partitions Rj of X such 
that Rj / T~*B. For condition i) of definition 3.1.1, it suffices to show that for each atom 
ae P, 

E^{a\T-'By Q) = E^{a\T-'B) a.e. 

Indeed, this implies conditionally on T'^'B, A couples P and Q independently. Since Q and 
P are arbitrary, condition i) of Definition 3.1.1 follows. 
Note that for each s e N and a E P, 
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Ex, {a\ V Rj y Q) ^ Ex{a\\/ Rj V Q) a.e. 

and 

Ex,{a\ V Rj V Q) > Ex,{a\T-'B V Q) = Ei,{a\T-'B) a.e. 

as Xi is one-sided. Thus, we have Ex(a\T^*-B V Q) > E^{a\T^'^B) a.e. by the Martingale 
Convergence Theorem. Since T"*i3 is a sub-a-algebra of T^^B V Q, the reverse inequahty 
also holds so that condition i) of Definition 3.1.1 is proved. For condition ii), note that for 
each cylinder set c C J(p), (-^x X '?/'y)~^(c x c) is a set of the form B x C, where B C. X and 
C CY. Since Aj ^ A in w*, we have Xi{B x C) ^ X{B x C). Hence, as 

Kii^x X V'y)"'(c X c)) = m(c), 

the same holds for A so that A also projects to the diagonal measure on /(p) x /(p). ■ 

The following proposition, which extends Lemma 3.7 in [H,R] to p-endomorphisms, gives 
an example of a one-sided joining. Given a factor map : X — > Y, yields a probability 
measure A^ on X x y defined by X^{B x C) = ii{B n 0"^C) for S e B and C e C. It is easy 
to check that A^ G J(X, Y). We say that A^ is the graphical joining induced by 0. 

Proposition 3.1.3. Suppose X = {X, B, ji, T) and Y = (Y, C, v, S) axe ergodic p-endomor- 
phisms. Let : X — > Y be a tree-adapted factor map. Then X^ e J+(X, Y). 

Proof: Note that for any t G N and any node u of length t, the conditional probability 
of Ty^x given x and (f){x) equals that of T^x given x, as A^ is the graphical joining arising 
from the factor map (p : X ^ Y. On the other hand, note that as is tree-adapted, for 
any x G 0~^(y), there is a unique node v of length t such that 0(T„x) = Clearly, the 
conditional probability of 8^1/ given x and y equals that of T^x given x. By Proposition 
2.1.3, since is a tree adapted factor map, Wu = Wy. Thus, the conditional probability of 
S^y given x and y equals that of S^y given y (in fact, both are equal to Wu). Since A^ is 
stationary, it follows that condition i) of Definition 3.1.1 of one-sided joinings holds for A^. 
By Proposition 2.1.3, for any cylinder set c in /(p), 0"^('0y^(c)) = i'x^ic)- Thus, 

A^(V'x'(c) X (c)) = l^{^i\c)) = m(c). 

This imphes condition ii) of definition 3.1.1. Hence A^ G J+(X, Y). ■ 
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Note that Proposition 3.1.3 implies that if X = {X, B, ji, T) is a p-endomorphisms, then 
J+(X, S+(p)) is non-empty. Indeed, we can easily construct a tree-adapted factor map 
01 : X — > S+(p) by choosing a tree partition of X and mapping a point x to its 
Xx-name. By Proposition 3.1.3, the graphical joining arising from this factor map gives a 
one-sided joining. 

3.2. The Copying Lemma 

In this section, we will establish the copying lemma for one-sided joinings which will be 
the key ingredient in the proof of Theorem 2.4.1. 

Let X and Y be p-endomorphisms and let A G J^(X, Y). For (x, y) E X xY , and a pair 
of nodes (v^u) of the same length j, let p^ y{v,u) be the conditional mass of {T^x, SuV) given 
{x, y). By condition i) of Definition 3.1.1, we have Px,yiv, u) = Wy and J2\v\=j Px,yiv, u) = 

Wu for A-a.a. (x, y). Moreover, if ^ w„, then Px^y{v,u) = for A-a.a. {x,y) by condition 
ii) of Definition 3.1.1. For a tree automorphism A and two nodes v and u, let A{v,u) = 
if It = Av, and A{v, u) = otherwise. 

Before we state the next proposition, we recall the following well-known theorem which 
states that any doubly stochastic matrix (i.e. a square matrix such that the entries in each 
row and column sum to one) is expressible as an average of permutation matrices (i.e. square 
matrices such that each row and column consists of precisely a single entry of 1 with the 
rest O's). The proof is based on Hall's Marriage Lemma and can be found in many standard 
combinatorial texts (eg. [Ryd]). 

Proposition 3.2.1. Let M be a doubly stochastic matrix of order n. Then M can be 
expressed as a convex combination of permutation matrices, i.e. there exist nonnegative 
reals Ci, . . . , q which sum to one, along with permutation matrices Pi, . . . , such that 

M = c^P, + --- + CtPf 

Remcirk: If in Proposition 3.2.1, M is a matrix such that the entries in each row and 
column sum to some fixed number a, then the conclusion still holds with Pi, . . . , Pj replaced 
by "permutation" matrices where each nonzero entry is a. 

Proposition 3.2.2. Let A e J+(X,Y) and N eN. Then for A-a.a. {x,y) e X xY, we 
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have a probability measure m^^y on An such that for any 1 < j < N, if \v\ = \u\ = j, then 

Px,v{v,u) = j A{v,u)dm:,^y{A). 

Proof: Fix {x,y) & X xY. We prove the result by induction on N. First, suppose N = 1. 
Let V be the set of nodes of length 1. Define a measure p on y x y by setting 

(v,u)eu 

for [/ QVxV. We wish to apply Proposition 3.2.1 to construct a measure on Ai from p. 
To do this, recall that each node of length 1 is an integer in {1, . . . , s}. We can represent the 
measure p as a s x s matrix M whose columns are indexed by 1, . . . , s such that M„„ = p{v, u). 
By the one-sidedness of A, note that M is a block diagonal matrix such that for each block, 
the entries in each row and column have the same sum (in fact, the sum for the j'-th block 
is just Psj)- For each of the blocks Af,, we can apply the remark following Proposition 3.2.1 
and express it as a convex combination of permutation matrices. Doing this block by block, 
gives a decomposition of M in the form 

i 

M = ^a„Q„, (1) 

where the a„'s sum to 1, QnS are distinct block diagonal matrices such that for each 
the j-th block is a permutation matrix with each nonzero entry being p^., and the sum 
is taken over all such possible matrices. We may then use this decomposition on M to 
define a measure on Ai as follows. For A & Ai, choose the unique matrix Qn such that 
{Qn)vu = A{v,u) for all v,u E {1, . . . ,s}, and set m(/l) = a„. Then (1) immediately implies 
that 

p^,y{v, u) = p{v, u) = M^„ = j A{v, u)dm{A). 

Next, assume that the result holds for N = t. We wish to build a measure on At+i 
such that the asserted equality in the statement of the proposition continues to hold for 
nodes of length < t + 1. Notice that each tree automorphism in At+i is defined by a tree 
automorphism in At combined with a collection of tree automorphisms in Ai indexed by the 
nodes of length t. To define the required measure m^+i on At+i, we proceed as follows. Fix 
A' e At+i and let A denote the restriction of A' to %. For each node |^;| = t, consider the 
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pair of points (T^x, SavV)- Using the basis case, we have a measure m„ on Ai such that for 
any \u\ = \u'\ = 1, 

PT^x,SA.viu,u') = J A{u,u')dmy{A). 

We then define 

m{A') = Yl m,{B,)m{A), 

\v\=t 

where By is the tree automorphism in Ai induced by A' on the trees of height one rooted 
at V and Av. A simple calculation using the basis case and the induction hypothesis when 

N = t shows that for every pair of nodes {v, u) of common length <t + l 

Pa:,y{v,u)= J A'(v,u)dmiA'). 
A'eAt+i 

■ 

For a finite set C and partitions P X ^ C and Q Y ^ C, define the joined 

partition P®Q: XxY-^CxChyP® Q{x,y) = {P{x),Q{y)). If A G J(X, Y), then 
A induces a stationary measure Ap^g on the shift space {C x C)^* via the map {x^y) — > 
(P ® Q{T'x, S''y))i>o. Clearly, we can extend Xp^q to a stationary measure on (C x C)^ and 
then restrict it to a measure on (C x C)~^ . 

For any e N and j > N, and a pair of elements (a, /3) in C^, let {a x denote the 
cylinder set 

{ze{Cx C)-^ I z, = -j <t<-j + N- 1}. 

Given A & A and tree names h, h' : T' ^ C, we then define a measure X{h,h',A) on the 
cyhnder sets by setting X(h,h' ,A){i<^ x P)~^) to be the total weights of all nodes \v\ = j such 
that 

{h{v),h{a{v)),...,h{a''-\v)))=a 

and 

{h'{Av), h'{a{Av)), . . . , h'{a^-\Av))) = (5. 

The following proposition states that we can represent Ap^g as an average of measures of 
the form A^^p^q ^i). This will be used to prove the copying lemma. 
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Proposition 3.2.3. With the notations above, for X e J"'"(X, Y) and partitions P : X ^ C 
and Q :Y ^ C for a finite set C, we have for each N eN, a family of probabihty measures 
rrix^y on An such that 



Xp^Qiia X /?)-^) 
for each pair of elements {a, j3) in . 

Proof: Abbreviate the cylinder set {a x as {a x P). Notice that as A is T x ^'-invariant, 



we have 



-I ^ 



><p^Q{oixP)= > p^^y{v,u)dX{x,y) 



By Proposition 3.2.2, we have a measure m,x,y on such that 

Pa:,y{v,u) = J A{v,u)dm^^y{A) 
for all pairs of nodes v and u of length A^. We then have 

>^p^Q{oixP)= Px,y{v,u)dX{x,y) 

J |„| = |u| = JV 

p''+(T„x) = q,q"+(S„!,)=/3 

-I ^ I 

^ Uil — U,l— AJ 



A{v, u) drrix^yiA) dX{x, y) 



|t>| = |u| = W 
p"+(T„a,)=c.,Q"+(S„„) = 





A{v,u)dm^^y{A)dX{x,y) 

\v\ = \u\=N 

\T^,r^,A){<^ X P)dm^^y{A)dX{x,y). 



We will now prove the copying lemma for one-sided joinings. Note that while it appears 
to be more general than the copying lemma in [H,R] (Proposition 5.4) in that we also copy 
distribution of tree names, the proof is essentially the same. Following the convention in 
Chapter 2, we will work with functions taking values in [0,1). 

Proposition 3.2.4 (Copying Lemma). Suppose X = {X, B, ji, T) and Y = (Y, C, v, S) are 

ergodic p-endomorphisms. Let X e J+(X, Y). If g : X ^ [0, 1) and h :Y ^ [0, 1), then for 
all e and N, there exists a function g onY such that 

\distig^^) - distig^^)\ <e 
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and 

\dist{g^+ ® h%+) - dist{g^+ V h^+)\ < e. 

Proof: Let g g^"^ V gN and h = h^^ V and let D denote (L'jv)^^ x (Recall 
that Djv is the set of midpoints of the dyadic intervals {t + l)/2^).) Choose M > N 

such that 2N/M < e/2. Construct a s/2-tree Rokhhn tower M of height M + 1 in Y. By 
Proposition 3.2.3, we have measures rrix^y on Am such that for each N < j < M and for 
each pair of elements (a, (3) in D'^ , 

X-g^f,{{a X p)-') = J j \^s^^u^A){{ax i3)-')dm,^y{A)d\{x,y). (1) 

Consider the partitions g'^'' : X — > ^^id h'^^ : Y D'^^ , we can assign a measure 
A to each atom of the partition x h^^'' x Am of X x F x Am by setting 

Xipxqx A)= / m^^y{A) dX{x, y). (2) 

J pxq 

By the Strong Tree Rokhhn Lemma, we may assume that the base C of M is chosen such 
that dist{h^^\C) = distQi^^). For each atom 7 e we define a partition 

: 7 n C ^ D^" X l)^" X Am 

such that 

(iist(P^) = \{g^^ X h^^ X Am\i)- 
The partitions P^ over all 7 collectively define a partition P of C such that 

ciisi(P) = dist{g^^ x h^^ x Am)- 

This gives a bijective correspondence p of the atoms of P and those of g'^'" x h^'" x Am such 
that if q; G P and = (/?,7,74), then « C 7, and u{a\C) = X{p{a)). 

We will now construct the required function g on the tower. Fix an atom a & P such 
that p{a) = (/3,7, A). Choose any point x E p. We define the function g on Uj^^ 5'~*q; by 
setting g{z) = g{TA-ivX) for z e S'^o;. By repeating this procedure for each atom a e P, we 
can extend ^ to a function on UM\P. Extend the function g to the rest of Y in any way we 
like. 

Set § = g^^ y gN- For an integer j such that N<j<M — N,we claim that 

dist{g''+ ® ^^+) = distif^ V h''+\S-^C^) (3) 
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If q; e P and p{a) = 7, A), then by our construction, notice that 

{gN{Syy), hM{S^y)) = {gN{TA-ivX), hNiS^y)) 
for any y & a and x & (3 and v e 7^. We thus have 

iSiSvy), kSvy)) = {9(Ta-ivX), h{Svy)) (4) 

for V e r^_^. 

Let Ci V C2 = {y e I ^^^(y) = d, h^^iv) = (2}- Then (4) imphes that for any x e /3 
and y e 7, 

i/(Ci V C2|-5-^a) = A(,|,,.,^)((Ci X C2)-^). (5) 
Note that (1) and (2) imphes that for e (3 and y(7) e 7, 

Hence, since u{S~^a\S~^C) = u{a\C) = A(/3,7, (5) and (6) imply that 

It follows that for < j < M - AT, 

which is (3). 

As i^(U]^^^ S-^C) > 1 - £, we then have 

dist{~g''+ - distif^ V < £, 

from which the conclusion easily follows, since g refines the partitions gN and g^'^ (and 
similarly for g and h). ■ 

3.3. The t Distance 

Suppose X = {X,B,iJL,T) and Y = {Y,C,v,S) are p-endomorphisms. Following [H,R], 
for a fixed m e N and functions : X — > [0,1) and /i : y — > [0,1) respectively, define the tm 
distance between the pair of processes (X, g) and (Y, h) by 

i„((X,y),(Y,/i)) = , inf / ^„(r,^r^(iA. 



AeC+(X,Y) 
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Let 

t{{X, g), (Y, h)) = liminf ?„((X, g), (Y, h)). 

Note that this definition of t„((X,5f), (Y, hj) differs from the definition in §2.4 in which we 
consider only the product joining A. Nonetheless, note that Proposition 2.4.6 (which will be 
used in §3.4) will also hold with the present definition. In fact, for a tvwB X and e > 0, ii 
dist{g^^) is sufficiently close to dist{h^^) for some A^, the proof of Proposition 2.4.6 shows 
that J tm{T^, Ty) dX < e for all large m for any couphng A (not just one-sided). 

Our goal in this section is to prove the following proposition which will be combined 
with the copying lemma in the previous section to give a joinings proof of Theorem 2.4.1. 

Proposition 3.3.1. Suppose t((X, g), (Y, h)) < e, then there exists A G J"'"(X, Y) such that 
J\g{x)-h{y)\ dX<s. 

The proof of Proposition 3.3.1 will depend on following proposition whose proof will be 
postponed until the end of this section. 

Proposition 3.3.2. For each m e N, there exists a A e C+(X, Y) such that 

- / TlaiT'x) - HS^y)\ dX{x,y) <iU(^,g),{Y,h)) + l/2--\ 

Proof (of Proposition 3.3.1): By Proposition 3.3.2, for each m G N, we have some 
Xm e C+(X, Y) such that 

^ « m—l 

- - dX^^^.y) < t„((X,5), (Y, h)) + 

i=0 

Choose an increasing sequence of integers ni, n2, . . . such that 

i„„((X, g\ (Y, h)) ^ ?((X, g\ (Y, h)\ 
so that by passing to a further subsequence if necessary, we have some L < e such that 

— / laiT'x) - HS'y)\ dX^^{x,y) ^ L. (1) 

1=0 

For i > and any measure A on X x y, let (T x SyX denote the measure defined by 
(T X SyX{D) = A((T X S)-'D) ior DCX xY. Define 

A™ = — ^ {TxSyX^^. 
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Since the set of one-sided couplings is convex, e (^^(X, Y). For any measure Xon X xY 
and i > 0, let 

Xig'Ah') = J \g{Tx) - h{S'y)\ dX. 



Then, 



-1 n-m — ^ 

A„(/A/i°) = — J] (T X SyXr^^ig'Ah') 



1=0 



Let A* be any w*-limit point of the A^'s. Since C~^(K,Y) is w7*-closed in C(X,Y), A* e 
C+(X, Y). It is clear that A* is stationary so A* e J+(X, Y). By (1) and (2), given 5 > 0, 
we have for all sufficiently large m, 

A*(^°A/i°)-A„(^°A/i°)~L. 

Hence, X*{g°Ah°) = L < e, since 5 is arbitrary. ■ 

We now turn our attention to the proof of Proposition 3.3.2. The proof will require the 
construction of a one-sided coupling of X = (X, B, /x, T) and Y = (F, C, S) , which we now 
describe. Given functions g : X ^ [0,1) and h : Y ^ [0, 1), and m G N, we have the joined 
partition g^'^ /i"^ of X x F defined by g^'^ h'^^{x,y) = For an atom 

l3 G g"^^ ® /i™^, let Afj denote a tree automorphism in Am which realizes tm('?"f", t^") for any 
(x, y) G 13. Set A^^y = A^ for any (x, y) G p. 

For each (x, y) G X x y , we define a measure supported on {T x y), which we will 

denote as A^y, in the following way. For any measurable set D C X x y, let A^y{D) be the 
total weights of the nodes v of length m such that {T^x, Sa^^^vV) ^ D. Given A G C+(X, Y), 
define the measure A on (X x Y, B x C) by setting 

X{D) = J A2y{D)dX{x,y) 

ioT D C X xY . It is straightforward to check that A defines a probabihty measure on X x y . 
We now show: 

Lemma 3.3.3. AgC+(X,Y). 

Proof: First, we need to see that A is a couphng. Let {ii^} be a disintegration of induced 
by the factor map T™ : X ^ X. li B CX, then A'^JB) = fjL^{B). Thus, 

X{B) = J Aly{B)dX = J iiAB)dfx = ii{B) 
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and so A projects to // on X. By symmetry, A projects to on y. Hence A e C(X, Y). 

We now need to check that A is one-sided. We first check condition i) of Definition 
3.1.1. For convenience, let us represent X and Y as one-sided shift spaces on [0,1]"^ (i.e. 
X = Y = [0,1]^ , and T = 5" is the shift map). Given integers < i < j, and a point 
z e [0,1]^*, we let 

= {y e [0,1]^ \yt = zt,i <t < j} 



and 



z[i, oo) = {y e [0,1]^ \yt = Zut> i}. 



By symmetry, it clearly suffices to prove that for each integer t > and for X-a.a {x', y') 
in X X r, 

E-^{x'[Q,t- l]|x'[i,oo) X y'[i,oo)) = E^{x'[Q,t- l]|x'[i, oo)). (1) 

We prove (1) by considering three cases: t = m, t < m and t > m: 
Case I: t = m 

Using the definition of A, if x = T"^x' and y = S'^y', we have 

Ex{x'[Q,m- l]\x'[m, oo) x y'[m, oo)) = E-x{x'[Q,m- IjlT'^^x x S'^'y) 

= ^™^(x'[0,m-l]) 
= Ei^{x'[0, m — l]\x'[m, oo)). 

Case II: t < m 

Observe that for A-a.a. {x', y'), if a; = T^x' and y = S'^y', then A^y{x\ y') > 0. For each 
such {x',y'), we have 

A2y{x'[0, m - 1] X y'[0, m - 1]) = -B^(x'[0, m - 1] \x'[m, oo)). 

We may then use Case I) to show that 

E-(x'\0 t - l]\x'\t oo) X 'It oo)) = -^^(^'[0'^- 1] ^ y'[t,m-l]\x'[m,oo) x y'[m, oo)) 
x[x[, l\x[,oo) y[,oo)) E-x{x'[t,m - 1] X y'[t,m - l]\x'[m, oo) xy'[m, oo)) 

_ A^Jx'[0,m-l]xy%m-l]) 

A^Jx'[t,m-l]xy'[t,m-l]) 

E^{x'[0, m — l]\x'[m, oo)) 

E^{x'[t, m — l]|x'[m, oo)) 
= ^^(x'[0,t-l]|x'[t,oo)). 

Case III: t > m 

This follows from a direct calculation using Case I) and the fact that A is one-sided. 
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We now prove condition ii) of Definition 3.1.1. Recall the Bernoulli shift -B+(p) = (/(p) = 
{1, r}"^*, m, a) as defined in §3.1. Consider the cylinder set 

C — {z E I{p) I ^0 = ''"O) • • • ) ^m-l = '>^m-l}j 

where e {1, . . . , r}. By the definition of the maps -^x and ■0y, we have a set Vc of nodes 
of length m whose total weights is m(C) such that 

ip-\C) = U T^X and Vy'(C) = U S^Y. 

vEVc v&Vc 

Hence, 

<.(V'x (C) X V'v'(C)) = E Ax,y{v,u) = m{C) 

v.uEVc 

Condition ii) now follows immediately from the definition of A and the one-sidedness of A. 
■ 

We are now ready to prove Proposition 3.3.2. 
Proof (of Proposition 3.3.2): Consider the partition gr^^ (g) /i™^ on X x y. Let A e 

C+(X, Y), construct A as in Lemma 3.3.3. For each atom /3 e g^™"" (8) /i""" and a pair of nodes 
{v, u) of length m, note that 

~X{{T,xS^)(3) = A0{v,u)X{(3). (1) 
Choose a point {x^, y^) e P for each atom /3. Note that by (1) and our choice of A^, 

^ m — 1 « 

— XI / \9m{T'x) - h^{S'y)\ dX 

f 1 '""^ 

= E / -Y,\9m{T'x)-h^{S'y)\dX 



|^| = |„|=m ''^''^•^"(^) " i=0 



m—1 

= 5Z -5^|(?„(TXx^)-/i„(5^5„y^)|A^(^;,H)A(/?) 



m 

|v| = |u|=m i=0 



Thus, 



m—1 



^ III — ± /J fit — J. n 

- 5^ / \gm{T'x) - hUS'y)\ dX = -J2 E / 1^^^'^) - ^^^^y^\ 



J iUrr,r;;-)dX. 
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Since \g - Qm] < l/2™+\ we have 

^ E / \9(T'x) - h{S^y)\ dX< j Ur!, <) dX + l/2"^-\ 
As this argument holds true for all A e (^"""(X, Y), the result follows. ■ 

3.4. A Joinings Proof of Theorem 2.4.1 

In this section, we prove Theorem 2.4.1 using the machinery of one-sided joinings devel- 
oped in §3.1 to §3.3. In fact, wc will prove a stronger result. We say that A G J+(X, Y) is 
an isomorphic joining if B = C. Note that each isomorphic joining is the graphical joining 
of an isomorphism from X to Y. We now prove 

Theorem 3.4.1. Let p be a probability vector. If X = {X,B,ijl,T) e End{p) is tvwB, the 
set of isomorphic joinings is a dense w*-Gs of J"'"(X, 5+(p)). 

Proof: Let Y = -B^(p). Choose a pair of generating functions / : X — > [0, 1) and g -. Y ^ 
[0, 1). Fix iV e N and 6 > 0. 

Let A e J+(X, Y). For each A^' e N and 5' > 0, by the copying lemma, we have a 
function / on y such that 

\dist{f^y)-distCf^y)\<6' (1) 

and 

\dtst,{f^'+ ® g^:+) - dist{f^'+ V ^^:+)| < 5'. (2) 
For £ > 0, if 1/A^' and 5' are sufficiently small then by (1) and the proof of Proposition 2.4.6, 

i((X,/),(Y, /))<£. 

By Proposition 3.3.1, we have some A' e J"'"(X, Y) such that 

\f{x)-]{y)\ dX'ix,y)<e. (3) 



/ 



Then provided that £, 1/A^', S' are small enough, we may use (2) to conclude that 

\dist,{f^+ g^+) - disty{f^+ (g) g^+)\ < S. (4) 

s 

As in [H,R], for A G J^(X., Y), we say that f gY if there exists some function honY such 
that J\f{x)-h{y)\ dX<e. Let = {A G J+(X,Y) | /cF}. Then O, is w*-open in 
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J+(X, Y), and (3) and (4) imply that O, is w^-dense in J+(X, Y). Define / C y if / C y for 

£ A 

all £ > 0. Note that f C.Y implies that BdC. 

OO 

Now, by Baire's Theorem, n is a dense w*-Gs in J+(X, Y). By symmetry, if 

e oo 

0'^ = {\ e J+{X,Y),g<zX}, we see that the set n 0[,^ is also dense in J+(X, Y). Hence, 

A n=l 

oo 

the intersection fi Oi/„ nOi/„ is a dense w*-Gs in J (X, Y) so that it is a residual set. This 

n—l 



concludes the proof as any A in the intersection satisfies f CY and g CX so that A is an 
isomorphic joining. ■ 

This conchidcs the proof of the isomorphism theorem, using the machinery of one-sided 
joinings. Note that by Proposition 3.1.3, the set of isomorphic joinings is non-empty. Thus, 
X and -B+(p) are isomorphic. 

One corollary of the joinings proof is that unless the components of p are pairwise 
distinct, there are uncountably many automorphisms of the one-sided Bernoulli shift S+(p) 
(and hence in general between any two tvwB p-endomorphisms) . To prove this, we first 
need the lemma below, which is just a special case of Proposition 3.1.3. In the following, we 
suppose -B^(p) is represented as the one-sided shift space ({1, . . . , s}^* , n, T) for the Bernoulli 
measure ii = {pi, . . -Ps}^' and the shift map T (so that j has weight Pj for 1 < j < s). 

Lemma 3.4.2. Suppose h : {l,...,s}^' {l,...,s} is tree-adapted and for a.a.x in 

{J, . . . , s}^' , Xq and h(x) have the same weight. Then the h-nanie map 9 : i?^(p) B^(p) 
defined by 9(x) = {h{x), h{Tx), . . .) is a tree-adapted factor map. Moreover, the graphical 
self-joining Xg of B~^{p) derived from 9 is one-sided. ■ 

Proposition 3.4.3. For a probability vector p with at least two identical components, there 
are uncountably many automorphisms of B^{p). 

Proof: Without loss of generality, suppose pi =p2- It suffices to show that J+(S+(p), -B+(p)) 
has no isolated points. Indeed, from the proof of the isomorphism theorem, the isomorphic 
joinings is a dense w*-Gs in J+(i?+(p), i?+(p)) . However, by the Baire Category Theorem, 
a dense w*-Gs in a complete metric space with no isolated points is necessarily uncountable. 

To this end, notice that there are at least two elements in J+(i3+(p), i3+(p)). Indeed, the 
graphical joining derived from the identity automorphism certainly is one. Another one is the 
graphical joining derived from the /i-name map for the function h : {1, . . . , s}^' — > {1, . . . , s} 
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defined by 

[2, ifa;o = l; 
h{x) = N 1, a Xq = 2; 

[ Xq, otherwise. 

(This is one-sided by Lemma 3.4.2). 

Let A e J+(S+(p),S+(p)) and £ > 0. It suffices to construct A' e J+(S+(p), S+(p)) 
such that X' ^ X and A'~A in w*. To see this, since there are at least two elements in 
J+(-B+(p), -B+(p)), just choose some one-sided joining Ai distinct from A. By Proposition 
3.1.2, convex combinations of A and Ai remain one-sided. Let A^ = (1 — 6)X + 6Xi. If 6 is 
small enough, then A^ will be sufficiently close to A in w*, and we are done. ■ 
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Chapter 4: 

Examples of TvwB p-Endomorphisms 



In this chapter, we will present two classes of examples of tvwB p-endomorphisms. It 
follows from the isomorphism theorem, Theorem 2.4.1, that all of these are isomorphic to 

4.1. One-Sided Markov Shifts 

Besides the one-sided Bernoulli shift S+(p), the simplest examples of p-endomorphisms 
can be found among the one-sided Meirkov shifts, which we will define below. 

It will be convenient for us to define a one-sided Markov shifts over left-infinite shift 
spaces of the form for a finite set C. We say that a square matrix A is stochastic if 
each entry is nonnegative and the sum of the entries in each row equals one. Let \A\ denote 
the number of rows (or columns) of A. We may obviously index the rows and columns of 
A by the integers from 1 to \A\ and denote the entries in A by A^, where 1< i,j < \A\. A 
stochastic matrix is irreducible if for each pair there exists A; G N such that {A^)^^ 

is nonzero. It is well known that for an irreducible stochastic matrix, there exists a unique 
row probability vector q with all components positive such that qA = q. We say that q is a 
left fixed probability vector of A. Let qj denote the j-th component of q. Using q and 
A, we may define a measure /i on the cylinder sets of {1, . . . , by 
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fJ,{X G {1, . . . , 1^1} I X-n — 0,-n-i ■ ■ ■ 1 X-1 — CL-l) — ?o_„^a_„a_(„_i) ' ' ' ^a_2a_i ■ 

It is easily seen that /i extends to a measure on the Borel sigma- algebra B of {1, . . . ,\A\}~^ . 
If T is the shift map on {1, . . . , l^l}""^, then note that = //T~^ (i.e. is shift invari- 
ant). We define the one-sided Mcirkov shift over A, denoted X^, to be the m.p.s. 
({1, . . . , B, jJL, T). We will refer to the integers {1, . . . , |A|} as the states of X^. 

We say that X^ is an irreducible Meirkov shift over A \l A is irreducible. Note that 
X^ e End{p) if and only if the entries in each row of A are the components of p, after 
deleting all zero entries. 

For each stochastic matrix A, consider the function : {1, . . . ,\A\} 

defined by /^(a^) = x_i. By placing the discrete metric d on {1, . . . , \A\}, it is clear that 
is generating . Note that for the one-sided Markov shift X^, the (—1)** coordinates and the 
conditional probabilities of the inverse images of any point x are completely determined by 
x_i. If X^ e End{p), we can thus choose a tree partition Kx- of X^ with the following 
property: whenever x_i = y_i, then x and y generate the same /^-tree name if the tree 
names are defined with respect to that partition. We will henceforth assume that for each 
X^ e End{p), we choose Kx- with this property. If / is a state of X^, let r/^ be the 
common /^-tree name generated by all points y with = I- 

We now wish to give examples of one-sided Markov shifts that are tvwB. Our first 
example is motivated by the well-known fact (Ornstein and Friedman) that a strongly mixing 
two-sided Markov shift is two-sided Bernoulli. Unfortunately, the one-sided analogue of this 
fact requires considerably more restrictions. Indeed, consider the one-sided Markov shift 
induced by the stochastic matrix: 

2/3 l/3\ 
1/3 2/3/ 

It is not difficult to see that X^ G End{^, |). However, as X^ is not tvwB (since the standard 
generator yields two tree names whose distance is 1 for all m G N), X^ is not isomorphic 
to and hence cannot be one-sided Bernoulli. (Note that, however, the two-sided 

Markov shift over A is isomorphic to the two-sided BernouUi shift |).) Nonetheless, as 
the following shows, for one-sided Markov shifts which are uniformly p-to-1 endomorphisms, 
strong mixing does imply one-sided Bernoulli. 
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Proposition 4.1.1. Let p e N and p = Suppose A is a N x N stochastic matrix 

such that the one-sided Markov shift is in End{p) and is strongly mixing. Then is 
tvwB. 

Some observations will be helpful before we prove Proposition 4.1.1. Note that for every 
stochastic matrix A, we may associate a directed weighted graph to it. Specifically, we define 
^(^4) to be the graph with 1^41 vertices identified by the integers 1, . . . , |A| with a directed 
edge from 7 to J labeled with weight w if Ajj = w ii w > 0. Note that if X^ e End{p), 
the set of weights of the edges extending out from any vertex in G{A) is the same. The 
critical observation which will be of use to us in the proofs of the ensuing propositions is the 
following: for any states I and J, there exists a node v of length n such that Tj^ (v) = J if 
and only if we have a path in G{A) of length n from vertices I to J. 

Proof (Proposition 4.1.1): For brevity, we let / denote throughout the proof. Since 
X^ is strongly mixing, there exists some integer n such that > for all 1 < i,j < N. 

We thus have some path in G{A) of length n between any two vertices in G{A). By the 
observation made just prior to this proof, this implies in particular that there exists some 
state J such that for all state 7, Tj{v) = J for some node v of length n. Hence, for all 
X e X^, r/(f ) = J for some node v of length n. 

For any pair of points {x',y'), let B^i y, be any tree automorphism in An such that 
r^,(f) = Ty, [B ,j.f ,yf [v)) = J for some node v of length n. Given a fixed pair of points (a;, in 
X^, we now build a tree automorphism B n-levcls at a time which makes im{Tl,T^) small 
for all large m. For < \v\ < n, let B{v) = B^ y{v). Inductively, assume that B{v) is defined 
for all \v\ < sn. For sn < \v\ < (s + let v = uv', where \v'\ = sn and < |k| < n. 
We then define B{v) = uB{v') if f{T^'X) = f{TB^y')y) and define B{v) = B^T,,^,T^^,,^y){u)B{v') 
otherwise. 

Now, we have some node u of length n such that /(T„x) = f{TB{u)y)- Since the /-tree 
name of x depends only on a;(— 1) and we are extending by the identity automorphism in the 
subtree rooted at u, it follows that if n < t < 2n, the weights of the nodes \v\ = t such that 

f{T^x) = f{TB{v)y) total to at least 1/p". For the nodes = n such that f{T^x) ^ f{TB(u)y), 
B is defined in such a way that we have some node \v\ = n such that f(T^^x) = f(TB{vu)y)- 
Thus, ii 2n < t < 3n, the weights of the nodes 1^1 = ^ with f{T^x) = f{TB(v)y) sum to at 
least 
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Inductively, we see that in general, if sn < t < {s + l)n, the weights of the nodes |f | = t 
with f{TyX) = J(TB(v)y) sum to at least 



This sum approaches 1 as s — > oo, independent of x and y. Hence it follows that is tree 
v.w.B. ■ 

The next proposition gives an additional class of tvwB Markov shift. 

Proposition 4.1.2. Let p be a probability vector {not necessarily uniform). Suppose that 
A is an irreducible N x N matrix such that e End{p) and for every pair of rows in A, 
we can find two identical nonzero entries in the same column, then is tvwB. 

Proof: Once again, consider the graph G{A) associated to A. Note that it suffices to find 
n e N along with paths of length n from each vertex ending at a common vertex such that the 
corresponding edges in the paths have equal weights. Indeed, this will allow us to conclude 
that for any x and y in X^, there exists a node v of length n such that t^^{v) = r^'^{B{v)) 
for some tree automorphism B, and we may argue as before to reach the conclusion. 

To construct the required paths, note that by assumption, there are edges extending 
from vertices 1 and 2 with the same weight Wi ending at a common vertex, say Ji, in G{A). 
We then choose any edge of weight Wi extending from vertex 3. If this edge ends at vertex 
J2, then again by assumption we have edges of equal weights from Ji and J2 which end at 
a common vertex. This allows us to extend the paths from each of vertices 1, 2 and 3 such 
that they all end at the same vertex and the corresponding edges in the paths have the same 
weight. A simple inductive argument enables us to construct the desired paths. ■ 

We end this section with a proposition which shows that we can decide whether a one- 
sided Markov shift is tvwB simply by checking tree names of a finite height (depending on 
the dimension of the stochastic matrix A). For a general p-endomorphism, deciding whether 
it is tree v.w.B is obviously a more difficult problem. 

Let X^ e End{p) and consider its associated graph G{A). Note that every path in G{A) 
"sees" a sequence of weights by reading the weights attached to the edges from the start to 
the end of the path. We now prove: 




s-l 
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Proposition 4.1.3. Suppose that A is an irreducible N x N matrix such that the Markov 
shift e End{p). Then is tvwB if and only if there exist paths of common length 
< N^^ from each vertex in G(A ) which see a common sequence of weights and end in the 
same vertex. 

Proof: We note that every edge e in the directed graph G{A) can be represented by the 
triple {s{e),t{e),w{e)), where s(e), t{e) and w{e) are the starting vertex, terminal vertex and 
weight of e. Since the number of different nonzero weights cannot exceed the number of 
vertices, we have at most different types of edges under this representation. 

Now, if {X^, Ja) is tvwB then we have paths Vj from each vertex J in G{A) which end at 
a common vertex and see the same sequence of weights. Indeed, by the tvwB condition, for 
any two vertices / and J', we must have paths Vi and Vp with the desired property. If /" is 
another vertex, then construct any path Vjn from I" which see the same sequence of weights 
as Vj. The three paths just constructed end in at most two distinct vertices so we may extend 
these three paths such that they end at a common vertex. An inductive argument gives us 
the required paths. 

Assume that the paths chosen have a common length > N^'^ . For a path u, let u{j) 
be the j-th edge of u. Then there are at most N^^ possible ordered A'"-tuple of edges 
(f i(j), . . . , f Ar(j)) for each j. Now, if the paths Vi, . . . , Vjv have more than N^^ edges, then 
there exist integers j < j' such that 

{Vl{j), VnU)) = (Viij'), VnU'))- 

Hence, we may shorten the path Vj by deleting the edges Vj{k) for j < k < j', for each vertex 
I. Clearly, the new paths still have the desired property but they have a shorter length. 
Hence, we may continue to shorten the paths to have a common length < iV^^. 

Conversely, if the asserted property holds, then for every pair of states / and J of X^, 
there is a node u of length < N^^ such that there exists a tree automorphism B with 
Tj^{u) = Tj^{Bu). The same argument in the proof of Proposition 4.1.1 shows that X^ is 
tvwB. ■ 

Since the tree names r/'* restricted to Tj^aw over all states / of X^ determine all paths of 
length at most N^'^ in G{A), Proposition 4.1.4 shows that it suffices to look at tree names of 
that height to determine whether X^ is tvwB. It is worth mentioning that Ashley, Marcus 
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and Tuncel [A,M,T] developed a general, though necessarily more comphcated, algorithm 
for deciding whether any two one-sided Markov shifts are isomorphic. 

4.2. A Generalization of [T,Id] 

In this section, we shall consider a case of the well-known [T, Id] transformations in 
the context of a general probability vector p = (pi, . . . and characterize those that are 
one-sided Bernoulli. 

The [T, Id] transformation can be described as follows. We consider the 2-shift -B+(p) = 
({0,1}^ , fj,, a) with p = (i, |), and a Lebesgue space {Y, C, v). Suppose T is an automorphism 
of y, define the map [T, Id] on the product space {0,1}'^ x Y with product measure /i x i/ 

by 

[T,Id]{x,g) = {ax,T^^'^g). 

It is not difficult to show that [T, Id] is measure preserving and the resulting m.p.s. ({0, 1} x 
Y^jixv, [T, Id]) is a p-endomorphism. 

A special case of [T, Id] occurs when Y is just the circle represented as [0,1) and T = i?„ is 
a rotation on Y by an irrational a. Hoffman and Rudolph [H,R] showed that this particular 
[T, Id] system (along with other isometric extensions of the uniformly p-to-1 endomorphisms 
with certain properties) are all tvwB and hence one-sided Bernoulli. 

Let us now extend the [T, Id] system to the situation when p is an arbitrary finite prob- 
ability vector. Consider a compact abelian metrizable group G with a translation invariant 
metric d' and Haar measure v (i.e. d'{hg,h'g) = d'{h,h') for all h, h' and g e G). Let d be 
the discrete metric on {1, . . . , n}. Define a metric D on the set R = {1, . . . , n} x G by 

D{{xi, gi), {X2, g2)) = -d{x^, x^) + -d {gx^g^)- 

Let p = (pi, . . . and let S = {1, . . . , n}^' . As usual, let -B"'"(p) = [B, /i, a) be the one- 
sided Bernoulli shift such that state j has weight Pj, for 1 < j < n. For each state j, we 
associate to it some element gj in G. Consider the transformation S on B x G defined 
by S{x,g) = {ax,ggj) if Xq = j. Let A be the product measure /i x i/, then note that 
{B X G, A, S) e End{p). 

Consider the function f : B x G ^ R defined by f{x,g) = {xo,g). Note that / is 
generating. Let /i and /2 denote the component functions of /, i.e. fi{x,g) = Xq and 
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f2{x, g) = g. Clearly /i : S x G — > {1, . . . , n} defines a tree partition of {B x G, A, S). We 
may thus use /i to define a set of partial inverses Sy for each node v of the p-tree T as 
described in Chapter 1. Clearly, we have fi{Sy{x, g)) = fi{Sy{x\ g')) for all nodes v of length 
>1 and for all (x, g) and {x\ g') in B x G. 

Proposition 4.2.1. {B x G, A, S) is tvwB if there exist i ^ j with pi = pj such that gjg~^ 
has dense orbit in G. 

Proof: We first assume the stated condition and prove that {{B x G, A, S), f) is tvwB. With 
no loss of generality, we may assume that pi = p2 and ^as dense orbit. Given e > 0, 

we note that there exists some s G N such that for any h and h' in G, we can find an integer 
< r < s such that d'{hg^'', h'g^'') < £■ To see this, partition G into sets of diameter < e/3. 
On each set, pick an arbitrary element in it. Say that yi, . . . ,yk are the elements picked. 
Since g2gi^ has dense orbits and G is abelian, for any pair (y^, yj) in G, there exists an integer 
r > such that d'{yig:['^ ^yjg^'') < s/3. We may then choose s to be larger than all these 
r's. Thus, for any two elements h and h' in G, we have some integer < r < s such that 
h'g:^-) < e. 

For any pair of points x and y in S x G, by the above paragraph, there exists some tree 
automorphism A and some node v of length r < s such that 

d'{f2{SyX),f2{SAvy))<e. 

(In fact, the inequality can be met by considering the node v = {1, . . . ,1) oi length r and 
letting A be any tree automorphism such that Av = {2, . . . , 2).) Moreover, for any node v', 
we have 

d'{f2iSv'vX), f2{Sy,Avy)) < £■ 

If v' is not the root node, then as we have observed, 

d{fi{Sy,yX)Ji{Sy,Avy)) = 0. 

Hence, we have some node u of length s and a tree automorphism A such that 

D{f{S^x), fiSAuV)) < e. 

If Pn is the smallest component in the probability vector p, we can imitate the proof of 
Proposition 4.1.1 to construct the required tree automorphism A s- levels at a time such that 
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whenever sN <t< s{N + 1), the nodes v of length t for which D{f{SyX), fiSAvV)) < £ have 
total weights at least 

N-l 

j](p„r(i-(p„)7- 

i=0 

Prom this, we see that {B x G, A, 5") is tvwB. ■ 

In the case when G is the circle [0,1) with Lebesgue measure, we can improve Proposition 
4.2.1 as follows: 

Proposition 4.2.2. {B x [0, J), A, S) is tvwB if and only if there exist i ^ j with pi = pj 
such that gjQi^ is irrational. 

Proof: Since the irrationals have dense orbits, the fact that {B x [0,1), A, S) is tvwB given 
the stated condition is just a special case of Proposition 4.2.1. Conversely, suppose the stated 
condition is false but {B x [0,1), /) is tvwB. We then have some positive integer N > 2 such 
that QjQ^^ is some integral multiple of whenever pj = p^. Suppose x and y are points in 
B X [0,1) with the property that f2{x) = t and f2{y) =t + for some t e [0,1). Then by our 
choice of the rotation factors gj, for every tree automorphism A and node v, if f2{SyX) = t', 
then f^iSAvV) = ^' + ^ + some integer k. Hence, d'{f2{S^x), f2{SAvy)) is bounded away 
from zero and so there exists some /3 > such that for all n, 

for all pair of points {x,y) such that f2{x) = t and f2iy) = t + ^ for some t G [0,1). Now, 
as any large set must contain a pair of such points, this contradicts the tvwB condition and 
completes the proof. ■ 
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Chapter 5: 

Finite Group Extensions of One-Sided 
Bernoulli Shifts 

In this chapter, we consider Lebesgue spaces with certain semigroup actions defined on 
them. The main result of this chapter is that for any two such spaces with semigroup actions 
{Tg}gQG and {Sg}g^G, thcrc is an isomorphism such that (j) o Tg = Sg o (p ior all g & G. 

5.1. An Isomorphism Theorem on TvwB G-extensions 

Let G' be a metrizable semigroup. For an arbitrary nonatomic Lebesgue space {X, B, /i), 
we define a G"-action on {X, B, ji) as a collection of measure preserving endomorphisms {Tg} 
with the property that TgTgi = Tgg> for all g, g' E G', and the map n : G' xX ^ X defined by 
Tr{g, x) = TgX is measurable. If (Y, C, v) is another nonatomic Lebesgue space with a G'-action 
{Sg} defined on it, then we say that ((X, B, /i),{Tg}) and ((F, C, z/), {Sg}) are G"- isomorphic 
if there exists a measure preserving bijection (p : X such that (j)Tg = Sg(f) for all g G G'. 

Consider a finite group G. Note that N* x G is a semigroup with operation defined by 
{n, g) ■ {n', g') = {n + n', g'g). For a probability vector p = (pi, . . . ^p^), we say that a N* x G- 
action {T(„^g)} on a nonatomic Lebesgue space {X, B, ji) is a (N* x G, p)-action if the m.p.s. 
(X, B, II, T(i_e)) is a p-endomorphism and T^o,g) is an automorphism of {X, B, /i) with no fixed 
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points unless g = e. The object of this chapter is to prove the following theorem. 

Theorem 5.1.1. Let {X, B, fi) and {Y, C, v) he two nonatomic Lebesgue spaces with (N* x 
G,p)-actions {T(„ g)} and {5'(„.g)} defined on them. If the systems X = {X, B, fi,T(^i,e)) and 
Y = {Y, C, z/, 5(i,e)) are both tvwB, then ((X, B, /x), {T(„,g)}) and ((F, C, ly), are N* x G- 

isomorphic. 

A natural example of a space with a (N* x G, p)-action is given by a G-extension of 
B~^{p), which we will now define. Let v be the uniform probability measure on the finite 
group G equipped with the discrete topology. Let / denote the set {1, . . . , s}. As we have 
seen, by assigning weight pj to j, we may represent the one-sided Bernoulli shift -B^(p) as 
{I^' , B, n,T), where T is the shift map and fi is the product measure {pi, . . . ,Ps}^' on the 
Borel sigma-algebra B. Let P : I^' I he the time zero partition defined by P{x) = Xq- 

Consider the product space x G with the product measure /ixv on its product sigma- 
algebra V. For a measurable partition (p : I^' — > G, define the map T'*' : I^' x G — > I^' x G 
by T*'(x, g) = {Tx, <p{x)g). We will refer to such partitions </? as a cocycle. It is easy to verify 
that T'^ is measure preserving. We define the (G, </?)-extension of -B+(p), denoted as S+(p)'^, 
to be the m.p.s. (/^* x G,V,p x v.T'f). More generally, we say that a m.p.s. is a G-extension 
of i?^(p) if it is the (G, (/9)-extension of -B^(p) for some cocycle ip : I^* G. It is clear that 
any G-extension of -B^(p) is a p-endomorphism. Define the partition P' : J^* x G / x G 
by P'{x, g) = (P(x), g). Let d be the discrete metric on / x G. Clearly, (/ x G, d) is a compact 
metric space and P' is generating. 

Given a G-extension S+(p)'^, note that each element {n,g) e x G corresponds to a 
measure preserving map obtained by composing the maps P(i,e) and P(o,g') given by 

V{i,e){x,g) = {Tx,p{x)g) 

and 

Vio,g'){x,g) = {x,gg'). 

It is easy to verify that the maps P(n.g) define a semigroup action, namely a N* x G- 
action, on I^* x G. (This is the reason why we defined (n, g) ■ (n', g') as (n + n', g'g) instead of 
{n + n',gg').) We say that the G-extensions S+(p)'^ and i3+(p)''* are N* x G-isomorphic if 
there is a measure preserving bijection p -. I^* x G — > I^* x G such that pp{n,g) = (t>{n,g)P 
for all {n,g) e N* x G. We say that p is a N* x G-isomorphism from B+{pY to S+(p)'^. 
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As poT'f = T"^op, the N* x G-isomorphism p is also an isomorphism from S+(p)'^ to S+(p)'^ 
in the sense of our original definition in Chapter 1. 

We begin the proof of Theorem 5.1.1 with some reductions. First, note that if (X, //) 
has a (N* x G, p)-action on it such that X = {X, B, ji, T(i_e)) is tvwB then we can represent it 
as a G-extension of -B"'"(p). In fact, consider the space X/G of G-orbits (so each point in X/G 
is a set of the form {T(o,g)x}ggG)- ^(i,e) projects to a measure preserving transformation T on 
X/G such that {X/G,T) is a p-endomorphism. Since T(o^g) has no fixed points unless g = e, 
the canonical projection is He : X ^ X/G is a tree-adapted factor map. Thus {X/G,T) is 
isomorphic to -B+(p) by Theorem 2.4.1 and Proposition 2.4.2. Representing the space X as 
X/G X G, it is immediate that {X, {T^n,g)}) is N* x G- isomorphic to {I^' x G, {(p(n,g)}) for 
some cocycle : I^' — > G. Thus we may rephrase Theorem 5.1.1 as follows. 

Theorem 5.1.1'. If i?+(p)'^ and i?+(p)'^ are tvwB, then they are N* x G-isomorphic. 

Note that by our definition, all G-extensions of -B+(p) are defined on the space I^' x G, 
although different G-extensions have different dynamics. To emphasize the difference in 
dynamics, for any cocycle ip : — > G, we will henceforth use to denote the product 
space X G of the m.p.s. S+(p)''' and we will denote the generating partition P' of by 

Let n : I^' X G — I^' be the canonical projection. For x G I^' and a subset A C 
let the fiber over x denote the set U^^{x) and the fiber over A denote the set Il^^{A). 
Note that for any N* x G-isomorphism p : ^ I^, since p<^(o,g) = <P{Q.g)P for all g & G, p 
maps fibres to fibres, i.e. for x e I^' , p(n~^(x)) = ^~^{y) for some y G /^*. Moreover, it is 
easily seen that the fibres are mapped to one another by a group rotation in the sense that 
for each x G /^*, there is g E G and x' G such that p{x, g') = {x' , gg') for all g' G G. 

Conversely, given measurable partitions </? : I^' — > G and : Z"^* — > G, an isomorphism 
Pi : -B+(p) — > -B+(p), and a measurable partition 9 : I^' — > G such that 

= (f>{p,ix))0{x), 

then it is direct that the map p : defined by p{x,g) = (pi(x).9{x)g) defines a 

N* X G-isomorphism from B^{pY to B^ipy. Thus, proving that B'^{py and -B+(p)'^ are 
N* X G-isomorphic amounts to constructing the maps pi and 9. 

The basic approach to the proof of Theorem 5.1.1' is the same as that of Theorem 2.4.1, 
with some necessary additions. We will focus primarily on these additions and refer the 
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reader to the various propositions in Chapter 2 when the proofs are exactly the same. 

Before we embark on the proof, some remarks on notations are in order. Rather than 
considering functions taking values in the compact metric space [0,1] as in Chapter 2, we 
will assume in this chapter that all the functions tcike values in the finite set 
I X G (i.e. are partitions), with the discrete metric d, unless otherwise specified. To 
emphasize this fact, we will denote the functions by capital letters such as Q and R instead 
of small letters g and h. Among the functions which range in / x G, we shall primarily be 
interested in those functions Q : I^' x G ^ I x G for which there exists a pair of measurable 
partitions Qi : I^' I and ^ : I^' G such that 

Q{x,g) = {Qi{x),^{x)g) 

for all {x, g) G I^' x G and Qi is tree-adapted. We say that Q is a G-map if it has the 
above property. We can thus think of each G-map Q as being defined by an ordered pair 
of functions of the above form. We will write Q = (Qi,0 if Q is defined by 

Note that the generating partition P' is a G-map. If Q is a G-map, then for each x e Q 
assigns the same first coordinate to any two distinct points in the fiber over x but distinct 
group coordinates. Thus, a G-map takes every fiber onto a set of the form {j} x G for some 
3^1- 

For the purpose of constructing tree names, it will be convenient to choose a "canonical" 
set of partial inverses for G-extensions as follows. Recall that each node v of the p-ary tree 
T is a finite sequence of integers in / = {1, . . . s}. Consider any G-extension i?+(p)'^. For 
each z G /^*, let vz be the point in /^* obtained by concatenating v to the left of z. If 
\v\ = t, then for each ix,g) in I^, we define the partial inverse by setting T^{x,g) to be 
the unique element in {T'^)~*{x, g) whose first coordinate is vx. 

Given a function Q on S+(p)'^, we may then define the partitions Q^'' : ^ {I x G)^^, 
:I^^{Ix Gf^ and : ^ {I x Gf exactly as before. Moreover, we may define 
the t distance between any two processes {B~^ {p)'*' , Q) and {B~^{p)'^,R) for functions Q and 
it! on the respective spaces as described in Chapter 2. Proposition 2.4.6 translates into the 
following. 

Proposition 5.1.2. Let S+(p)'^ be tvwB. Suppose Q is a function on I^. For all e > 0, 
there exist S and N with the following property: for any G-extension B~^{p)'^ and function 
R on I^, if i?^^ ~ g^^, then t{{B+{pr, Q), (S+(p)^ R)) < e. U 
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Proposition 5.1.3. Consider G-extensions S+(p)*' and S+(p)"'^, along with G-maps Q on 
and R on I^. Then for all g E G and all v e T', 

d{Q{T:{x, g)), R{Tt{x\ g))) = d{Q{T:{x, e)), R{T^^{x', e))). 

Proof: This follows immediately from the fact that as Q is a G-map, if Q{T^{x, e)) = (j, g'), 
then Q{T^{x, g)) = (J, g'g), and hkewise for the G-map R. ■ 

We are now ready to prove the analogue of the perturbation lemma, Proposition 2.4.7. 
For {j,g) e I xG and g & G, set (j, g)-g = (j, gg). For any G-extension S+(p)'^, a measurable 
set B in and g E G, let Bg denote the set B fl {I^' x {g}) (so Bg is the subset of B with 
group coordinate g). In the following proposition and its proof, we will use the notation 
Oj(5), i = 1, 2, ... to denote real- valued functions of S such that lims^oOi{6) = 0. 

Proposition 5.1.4. Consider an crgodic G-cxtcnsion i?+(p)'^. Then there exists some Oi{6) 
with the following property: For any ergodic G-extension B^{y>)'^ and any G-map Q on I^, 
if i((S+(p)'^, P'^), (-B+(p)*, Q)) < 6, then for all e and N, there exists a G-map Q' on such 
that \Q - Q'\ < Oi(5) and {P^f^ ~ Q'^^. 

Proof: We proceed along the hues of Proposition 2.4.7 except that we cannot use only a 
single tree name to define a function on the chosen tree Rokhlin tower as we did in Chapter 
2. For ?7 > to be specified later, choose M e N such that 

a) N/M<ri/2 

b) there exists a set 5* C I'^' with n{S) > 1 — rj such that whenever x & S, {x, g) is 
77,(M - A^)-generic for {Pv')^'^ for all g- G G 

c) JiM{rZ,r^,h)d{i^ x u){x,g)d{n x iy){y,h) < 5. 

From b) and c), if 77 is sufficiently small, we have some x E S such that there exists a 
set S' C I^' with pL{S') > 1 - 02(6) and tM(ri^I, r^^e) < 02{5) whenever z E S'. Using the 
Strong Tree Rokhlin Lemma, build a 77/2-tree Rokhlin tower M of height M + 1 in S+(p) 
such that at least 1 — 02(5) fraction of the base B is in S'. Note that M' = n"^(M) is also a 
tree Rokhlin tower in i?+(p)'* with base B' = 11^^(5). For each (y, e) G B'^, we define Q' by 
laying the tree name r^^ on {{T'^y^{y,e) \ 1 <i < M} via a suitable tree automorphism A 
which optimizes tuiTxl ^'^ye) ^"^^ i'^ Proposition 2.4.7. This defines Q' on the column 

over B'^. We then extend Q' to points in the column over B'^ for each 5^ e G by defining 
Q'{T^{y,g)) = Q'{Tf{y,e)) ■ g. Note that restricted to UM'\S', Q' is a G-map. By our 
construction, UM'\S' is a union of complete fibers, i.e. UM'\S' = U~'^{E) for some set E 
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in . We may thus extend Q' to the rest of the space such that it remains a G-map by 
defining Q'{x,g) = {P{x),g) on UM'\S'. 

By Proposition 5.1.3 and our definition of Q\ whenever x' E S' f] B, we have for each 

g^G, 

^ J2 wJ{Q{T:^{x',g)\Q\T^{x\g))) = ^ ^ wJ{Q{T:^{x' ,e)\Q'{Tt{x' ,e))) 

0<|j;|<M 0<|t>|<M 

_ J ( Q ^ 

~ ^M\.i x,e ■> ' x',e) 

< 0,{6) 

Thus, since Hb{S') > 1 — 02(5), we have 

J diQ{z, g), Q'{z, g)) dii{z)dv{g) < 0^{5). 

UM'\B' 

Since ^ x z/(UM'\i?') > 1 — 5 for sufficiently small we have \Q ~ Q'\ < Oi{5). 

To prove that (P'fi)^'^ ~ Q'^^ , note that by our construction of Q', we have Q'^''^{x', g) = 
(P'^)^^(a;, g) for all {x', g) e B' . By a) and b), since x E S, the same reasoning in Proposition 
2.4.7 shows that the required tree distributions differ by less than e. ■ 

The following is an immediate consequence of the proof of Proposition 5.1.4. 

Proposition 5.1.5. Consider ergodic G-extensions B^{p)'^ and B~^{p)'^. For every e and 
N, there exists a G-map Q on such that {P'Pf^ ~ Q^^ . m 

The next proposition is the Strong Sinai's Theorem which is formally identical to Propo- 
sition 2.4.9. 

Proposition 5.1.6 (Strong Sinai's Theorem). Suppose B~^{p)'^ is tvwB. Given e > 0, 
there exist S and N with the following property: If S+(p)* is an ergodic G-extension, Q is 
a G-map on S+(p)^ with [P'^f^ ~ Q^^ , then there exists a G-map Q' on S+(p)'^ such that 
dist{B+ ip)''', Q') = dist{B+{pY, P^) and \Q - Q'\ < e. 

Proof: Use the approach in Proposition 2.4.9 to choose 6 and 1/N small enough such that 
wc have a Cauchy sequence of G-maps on converging to some map Q' such that 
\Q -Q'\ <e with dist{B+{p)^, Q') = dist{B+{pY, P^). It is easily checked that Q' is also a 
G-map. ■ 

Given the G-map Q = {Q^, Q such that dist{B+ (p)''' , Q) = dist{B+{pY, P^), we can pro- 
ceed as in Chapter 2 to construct a tree-adapted factor map p : B~^{pY — > -B+(p)'^ defined 
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by p{x,g) = {Q^' {x)X{Q^' {x))g)- Moreover, for any G-extension B~^{p)'^, the canonical pro- 
jection n : 7^ — > I^' is clearly a factor map from S+(p)'^ to -B+(p). Combining Propositions 
5.1.5 and 5.1.6, along with the fact that the partition Q' constructed in Proposition 5.1.6 is 
a G-map, the following corollary is immediate. 

Corollary 5.1.7 (Sinai's Theorem). Suppose i?+(p)'^ is tvwB and i?+(p)'^ is ergodic. 
Then there exists a G-map Q on S+(p)'^ such that dist{B+ (p)'^ , Q) = dist{B+{pY .P'^). 
Moreover, the factor map p S+(p)'^ — > S+(p)'^ as constructed above projects to a factor 
map Pn ■■ B~^{p) — B~^{P) such that the following diagram commutes: 

B+{p)'i> A S+(p)^ 
ni in 

B+{p) ^ B+{p). 

Pn 

■ 

Our next goal is the copying lemma. Recall that for B C I^' x G and g & G, we defined 
the set Bg to be the subset of B with group coordinate g. For g E G and a set 5" C I^' x G, 
let 

S-g = {{x,g'g)eI''' xG\{x,g')eS}. 

Proposition 5.1.8 (Copying Lemma). For ergodic G-extensions i?+(p)'^ and B^{p)'^, let 
Q be a G-map on such that dist{B+ {pf , Q) = dist{B+{p)^, P'^). Then for all e and N, 
we have a G-map Q' on such that 

distiQ' V P'^f^ ~ dtstiP'^ V Q f^. 

Proof: Let M be chosen such that N/M < e/2. Define a partition (p : I^' I x G by 
(p{x) = {P{x), ip{x)). Construct a s/2-tree Rokhhn tower M of height M + 1 in -B+(p) with 
base B such that dist{(p^^) = dist{(p'^^\B). Using the canonical projection n : 7^ — > 7"^*, 
we may lift the partition (p'^^ and the tree Rokhlin tower M so that we may regard (p^^ 
as being a partition of 7^ and the tree Rokhlin tower M with base B as being in B^{pY . 
Clearly, wc still have dist{if>'^^^) = dist{(p^^\B). For any atom a G (p^^ and g ^ G, note 
that all points in have the same P^-M-tree name. Thus, dist{{P'^ f^'^) = dist{{P'^Y'^\B). 

Consider the tree-adapted factor map p : B~^{pY B~^{pY as constructed in Sinai's 
Theorem. For any atom a e (p^^ , note that if (x, g) e a, then (P^)*^^(x, g) = Q^^{y, g') for 
any {y,g') e p-\x,g) by Proposition 2.1.4. Thus (P^)^^(Q;g) = Q^^{p-\ag)). 
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For each atom a e ^'^^ ^ construct a partition of cce fl S such that 

dist{Ul) = distiiP'^ V Qf'^\p-\a,)) (1) 

We then define the required partition Q' on the column of the tower over as follows. Fix 
an atom a e By (1), we can define a bijective correspondence A of the atoms of 

and those of {P'^'y Q)^^ with the same conditional measures. Consider an atom (3 E U^. As 
we have already noted, (P^)^^(x,e) = Q^^{x\g') for any (x, e) e P and {x',g') e A(/3). We 
may then define Q' on the tower over /3 by laying the P"'^-M-tree name of some point {x\ g') 
in A{(3) via suitable tree automorphisms such that {Q' V P^)^^(x, e) = {P'*' V Q)^^ix', g') for 
any (x, e) G Repeating for each P E and then for each a G <^*^^, we can then define 
Q' on the tower over B^. 

We now extend Q' on the column of the tower over Bg for all g E G. To do this, 
note that if {x,g') is in the column over B^, then {x,g'g) is in the column over Bg. Define 
Q'{x, g'g) = Q'{x, g') ■ g. This defines Q' on UM\P. It is clear that Q' is a C-map restricted 
to UM\S. Since UM\S is B-measurable, we may then extend Q' so that it is a G-map on 
the full space by setting Q'{x, g) = {P{x), g). 

We now prove that dist{Q' V P^)^^ ~ dist{P'>' V Q)^^. If «; e (7 x G)^^ and geG, let 
K- g E (/ X G)^^'^ be defined by (k ■ g){v) = k{v) ■ g. If k G (/ x G)^^ and k is a representative 
in K,, define fi ■ g E {I x G)*^^ to be the equivalence class containing k, • g. For any atom 
a E (p'^^ and /3„ G U^, note that if {x, e) E (3a and (a;', g') E HPa)i then for each g E G, 

{Q' V P^r^'ix, g) = HQ' V Pn''''{x, e)) ■ g 

= ((P^vg)^^(x',^'))-5 
= (P^vg)^^(x',y'^). 

This implies that /3„ • g and A(/3„) • g have the same image under {Q' V pv)^^ ^p4> y q^mv 
respectively. Moreover, since Q is a G-map, p~^{ag) = p'^ia^,) ■ g and so 

p, X i^(A(/5„) • g n p~\ag)) ^ px z/(A(/5„) ■ g\p~\ag))p x u{p'\ag)) 

= px v{K{p^)\p-\a^))p X u{p-\a^)) 

= px iy{A{Pa)\p~\ae))p X iy{ae) 

= px i'{(3a\a^ n B)p X v{a^\B) 

= px iy{/3^\B) = px iy{/3a ■ g\B) 
As this holds for all gf G G, a G <^^^, and E U^, we have 

dist{P'^ V Q)^^ = dist{{Q' V P'^)^^|P) 
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Hence, by the same reasoning as in the proof of Proposition 2.4.12, we see that the distribu- 
tion of {Q' V p¥')Jvv gg^p]^ igygi Qf ^jjg ^Q^gj. except the top N levels equals dist{P'>' V Q)^^ . 
Since A^/M < s/2, the result follows. ■ 

The rest of the proof now follows along the same lines as Theorem 2.4.1. To begin, we 
have the following proposition which is formally identical to Proposition 2.1.13. 

Proposition 5.1.9. Consider tvwB G-extensions B~^{p)'^ and S+(p)'^. Suppose Q is a G- 
map on such that dist{B+ {p)'^ , Q) = dist{B+{pY, P*'). Then for every rj > 0, £ > 0, there 
exists a G-map Q' on such that 

i) P^cE(Q') 

a) dist{B+{pf,Q') =dist{B+{pY,P^) 
Hi) \Q-Q'\<ri. 

m 

Proof (Theorem 5.1.1'): Imitating the proof of Theorem 2.4.1, we choose, via proposition 
5.1.9, a Cauchy sequence of G-maps {U^} on converging to a G-map U on such that 

dist{B+{pf, U) = dist{B+{pY, P^) and ^(f/) = Y.iP^)- 

Let U = (f/i,^). Since Y.{U) = the factor map p -. 5+(p)^ B+ip)'^ defined by 

p{x, g) = {Ui{x),^{x)g) is an isomorphism, where tJi{x) is the C/i-name of x. Since pT^ = T'^p, 
it follows that 

ip{U,{xmx) = aTx)<f>{x) 

Obviously, Ui defines an automorphism of -B+(p). Hence, p defines a N* x G- isomorphism 
from B+{p)'^ to B+ipf. ■ 

5.2. Some Applications of Theorem 5.1.1 

Example 5.2.1. As an application of Theorem 5.1.1, consider a finite group G of order > 3 
with two distinct generators h and h' (thus G = Z/nZ, for n > 3). Fix a probabihty vector 
p = ipi, ■ ■ ■ ,Ps) with pi = Define a map ip -. /^* ^ G by 



Note that the P'^-tree name of {x,g) is completely determined by P'^{x,g) = {xo,g). 
Let T(j_g) be the common P'^-tree name of all points {x,g) such that Xq = j. Using the 





e otherwise. 
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ideas in §4.1, to prove that 5+(p)'^ is tvwB, it suffices to show that for any two elements 
(J, g) and (j', g') in I x G, we have some node v and some tree automorphism A such that 
'^U,g)i''^) = '''0',s')(^'^)- i^ct, it is enough to show this in the case when j = f = 1. 

Choose A; e N such that g' = hf'g. Then for the node v = (1, 1, 1, . . . , 1) of length /c + 1, 

T(j^g){v) = (1, for the node « = (1, 2, 2, . . . , 2) of length A; + 1, r(j,g')(M) = (1, g). Since these 
two nodes can be matched by a tree automorphism, we have T(j,g)(t') = TQi^gi){Av) for some 
tree automorphism A. Thus, B^{TpY is tvwB. 

In the same way, for the map -0 : I^' —>■ G defined by 



the same argument shows that S+(p)'^ is tvwB. Thus Theorem 5.1.1 shows that S+(p)'^ and 
S+(p)'^ are N* x G-isomorphic. 

Note that the two cocycles and '4> are not cohomologous provided that hh'~^ is a gen- 
erator (if they were cohomologous, then a N* x G-isomorphism between the corresponding 
extensions exists trivially). Indeed, if they were cohomologous, the G-extension obtained 
from the map (j) — ip defined by (0 — 'ip){x) = (j){x)il){x)~'^ would not even be ergodic, contra- 
dicting the above argument which shows that it must be tvwB and hence ergodic. ■ 

The following proposition, as observed by del Junco, gives another interesting application 
of Theorem 5.1.1. We say that a tree-adapted factor map : X — > y is uniformly p-to-1 
if the fiber over a.a.y in Y contains p points each with conditional probability 1/p. One 
example of such a factor map is if X = Y is the one-sided (l/2,l/2)-Bernoulli shift and 
is the addition map defined by (6(.x)j = + Xj+i. The following proposition essentially says 
that in the case when p = (1/2, 1/2), then this is (up to automorphism) the only example. 

Proposition 5.2.2. Suppose X = {X, B, T) e End{p) is tvwB. Let Y = {Y, C, S) be 

a tree adapted factor of X. For any pair of uniformly 2-to-l tree-adapted factor maps 
Hi : X — > Y and Ha : X — > Y, there exist automorphisms p : X — > X and (/? : Y — > Y such 
that II2 o p = (f oUi. 

Proof: We define two (N* x Z2, p)-actions on {X.B.fj) as follows. Let Z2 = {0,1}. Define 
T(i_o) = and 2^(0,0) to be the identity map. Define T^o^i) : X ^ Xhj sending x to the unique 
point x' distinct from x such that ni(x') = ^i{x). The fact that Hi is uniformly 2-to-l implies 
that T(o,i) is an automorphism of X. Using T(i o), 2^(0,0) ^-nd T(o,i), we can define T(„_g) by group 




h' if Zq = 2; 



e otherwise. 
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composition. By tree-adaptedness, it is easy to check that {T^n,g)} defines a (N* x Z2,p)- 
action on X. In a similar way, using 112, we can define a second (N* x Z2,p)-action 
on X. 

By Theorem 5.2.2, we have a N* x Z2-isomorphism p {X, {T(„g)}) — > (X, {S(n,g)})- In 
particular, p defines an automorphism of X. Note that as 5'(o,i) o p = po T(o,i), p maps fibres 
of Hi to those of 112 and so the map (/? : Y ^ Y defined hj = {U2) opo (iii)"^ is well-defined 
and is clearly an isomorphism with the required property. ■ 

Remcirk 5.2.3. It is reasonable to believe that Theorem 5.1.1 should also hold in the case 
when G is a compact metrizable group instead of just a finite group. We were unable to prove 
this, although it seems that the same ideas should apply and is likely a technical extension. 
One possible approach would be to define a partition Q at on G by dividing G into subsets 
of diameter < 1/2^ for each and define tree distributions by a suitable discretization. 

Second, it would be interesting to give a joinings proof of this result, though the definition 
of one-sided joinings might need to be modified to achieve the required isomorphism. 
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Chapter 6: 

Some Open Problems 



This thesis has extended the investigation of the isomorphism problem of non-invertible 
endomorphisms initiated by Hoffman and Rudolph. Obviously, much is left unaddressed. In 
this chapter, we state some problems in this area which warrant further investigation. 

Problem 1. Consider a Lcbcsguc probability space {X,B,fi) with two commuting endo- 
morphisms T and 5* on it, i.e. TS = ST a.c. The basic model is the two-dimensional lattice 
{0, Ij^^^'^^ with product measure (1/2, 1/2)*^'^*^^ and with T and S being the shift maps in 
each direction. Given another Lebesgue space {Y,C, v) with two commuting endomorphisms 
T' and S' on it with the same property as T and S, is there a reasonable way to decide when 
{X, B, ji) and (Y, C, v) are N* x N*-isomorphic, i.e. when can we find a measure preserving 
bijection : X ^ y such that 07 = r'0 and 05 = S'cf) a.e.? 

Problem 2. So far, all endomorphisms X = {X, B, fi,T) considered are homogeneous in 
the sense that for a. a. x, x has the same number of inverse images with the same set of 
conditional probabilities. Obviously, most endomorphisms do not share this property. Can 
one give a reasonable classification for a subset of these? If we have a fixed one-sided Markov 
shift which is not homogeneous, is there some reasonable criterion to ensure that a given 
endomorphism is isomorphic to it? 
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Problem 3. Proposition 5.2.2 in particular implies that there is essentially only one tree- 
adapted uniformly 2-to-l endomorphisms of -B+(p). It would be interesting to classify tree- 
adapted uniformly 3-to-l endomorphisms (and in general p-to-1 endomorphisms) of S+(p). 
The proof of Proposition 5.2.2 cannot be used since even for tree-adapted uniformly 3-to-l 
maps of B~^{p), there is no canonical way of constructing a N* x Za-action on the Bernoulli 
shift space. 
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